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Preface 


The world is continuous, but the mind is discrete. 


David Mumford 


The mathematical interplay between polytopes and lattices comes to life when 
we study the relationships between the continuous volume of a polytope and 
its discrete volume. Since the humbling and positive reception of the first 
edition of this book, published in 2007, the field of integer-point enumeration 
in polyhedra has gained considerable momentum. Many fields of mathematics 
have begun to interact in even more surprising ways, and a beautifully simple 
unification among a multitude of classical problems continues to emerge when 
we use this combinatorial-geometric lens. 

In this second edition, and with encouragement from many wonderful 
readers, we have added two new chapters: Chapter 9 introduces zonotopes, 
an extremely useful class of polytopes, and Chapter 10 explores some deep 
yet elegant relationships that are satisfied by the h-polynomials and h*- 
polynomials. We have also added many new exercises, new and updated open 
problems, and graphics. 

Examples of polytopes in three dimensions include crystals, boxes, tetra- 
hedra, and any convex object whose faces are all flat. It is amusing to see 
how many problems in combinatorics, number theory, and many other mathe- 
matical areas can be recast in the language of polytopes that exist in some 
Euclidean space. Conversely, the versatile structure of polytopes gives us 
number-theoretic and combinatorial information that flows naturally from 
their geometry. 

The discrete volume of a body P can be described intuitively as the number 
of grid points that lie inside P, given a fixed grid in Euclidean space. The 
continuous volume of P has the usual intuitive meaning of volume that we 
attach to everyday objects we see in the real world. 
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Fig. 0.1 Continuous and discrete volume. 


Indeed, the difference between the two realizations of volume can be 
thought of in physical terms as follows. On the one hand, the quantum-level 
grid imposed by the molecular structure of reality gives us a discrete notion of 
space and hence discrete volume. On the other hand, the Newtonian notion of 
continuous space gives us the continuous volume. We see things continuously at 
the Newtonian level, but in practice, we often compute things discretely at the 
quantum level. Mathematically, the grid we impose in space—corresponding 
to the grid formed by the atoms that make up an object—helps us compute 
the usual continuous volume in very surprising and charming ways, as we 
shall discover. 

In order to see the continuous/discrete interplay come to life among the 
three fields of combinatorics, number theory, and geometry, we begin our focus 
with the simple-to-state coin-exchange problem of Frobenius. The beauty 
of this concrete problem is that it is easy to grasp, it provides a useful 
computational tool, and yet it has most of the ingredients of the deeper 
theories that are developed here. 

In the first chapter, we give detailed formulas that arise naturally from the 
Frobenius coin-exchange problem in order to demonstrate the interconnections 
between the three fields mentioned above. The coin-exchange problem provides 
a scaffold for identifying the connections between these fields. In the ensuing 
chapters, we shed this scaffolding and focus on the interconnections themselves: 


(1) Enumeration of integer points in polyhedra—combinatorics, 
(2) Dedekind sums and finite Fourier series—number theory, 
(3) Polygons and polytopes—geometry. 


We place a strong emphasis on computational techniques and on computing 
volumes by counting integer points using various old and new ideas. Thus, 
the formulas we get should not only be pretty (which they are!) but also allow 
us to compute volumes efficiently using some nice functions. In the very rare 
instances of mathematical exposition when we have a formulation that is both 
“easy to write” and “quickly computable,” we have found a mathematical 
nugget. We have endeavored to fill this book with such mathematical nuggets. 

Much of the material in this book is developed by the reader in the more 
than three hundred exercises. Most chapters contain warmup exercises that 
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do not depend on the material in the chapter and can be assigned before the 
chapter is read. Some exercises are central, in the sense that current or later 
material depends on them. Those exercises are marked with &, and we give 
detailed hints for them at the end of the book. Most chapters also contain 
lists of open research problems. 

It turns out that even a fifth grader can write an interesting paper on integer- 
point enumeration [192], while the subject lends itself to deep investigations 
that attract the current efforts of leading researchers. Thus, it is an area of 
mathematics that attracts our innocent childhood questions as well as our 
refined insight and deeper curiosity. The level of study is highly appropriate 
for a junior/senior undergraduate course in mathematics. In fact, this book 
is ideally suited to be used for a capstone course. Because the three topics 
outlined above lend themselves to more sophisticated exploration, our book 
has also been used effectively for an introductory graduate course. 

To help the reader fully appreciate the scope of the connections between 
continuous volume and discrete volume, we begin the discourse in two dimen- 
sions, where we can easily draw pictures and quickly experiment. We gently 
introduce the functions we need in higher dimensions (Dedekind sums) by 
looking at the coin-exchange problem geometrically as the discrete volume of 
a generalized triangle, called a simplex. 

The initial techniques are quite simple, essentially nothing more than 
expanding rational functions into partial fractions. Thus, the book is easily 
accessible to a student who has completed a standard college calculus and 
linear algebra curriculum. It would be useful to have a basic understanding of 
partial fraction expansions, infinite series, open and closed sets in R?, complex 
numbers (in particular, roots of unity), and modular arithmetic. 

An important computational tool that is harnessed throughout the text 
is the generating function f(x) = >) a(m) x™, where the a(m)’s form a 
sequence of numbers that we are interested in analyzing. When the infinite 
sequence of numbers a(m), m = 0,1,2,..., is embedded into a single generat- 
ing function f(x), it is often true that for hitherto unforeseen reasons, we can 
rewrite the whole sum f(x) in a surprisingly compact form. It is the rewriting 
of these generating functions that allows us to understand the combinatorics 
of the relevant sequence a(m). For us, the sequence of numbers might be the 
number of ways to partition an integer into given coin denominations, or the 
number of points in an increasingly large body, and so on. Here we find yet 
another example of the interplay between the discrete and the continuous: we 
are given a discrete set of numbers a(m), and we then carry out analysis on 
the generating function f(x) in the continuous variable x. 


What Is the Discrete Volume? 


The physically intuitive description of the discrete volume given above rests 
on a sound mathematical footing as soon as we introduce the notion of a 
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lattice. The grid is captured mathematically as the collection of all integer 
points in Euclidean space, namely Z4 = {(x1,...,2q@) : all a, € Z}. This 
discrete collection of equally spaced points is called a lattice. If we are given a 
geometric body P C R%, its discrete volume is simply defined as the number 
of lattice points inside P, that is, the number of elements in the set Z¢N P. 

Intuitively, if we shrink the lattice by a factor k and count the number 
of newly shrunken lattice points inside P, we obtain a better approximation 
for the volume of P, relative to the volume of a single cell of the shrunken 
lattice. It turns out that after the lattice is shrunk by an integer factor k, the 
number # (P al ¢Z*) of shrunken lattice points inside an integral polytope P 
is magically a polynomial in k. This counting function # (P a ¢Z*) is known 
as the Ehrhart polynomial of P. If we kept shrinking the lattice by taking a 
limit, we would of course end up with the continuous volume that is given by 
the usual Riemannian integral definition of calculus: 


. 1_a\ 1 
vol P = im # (Pr Ze ) Ea: 
However, pausing at fixed dilations of the lattice gives surprising flexibility 
for the computation of the volume of P and for the number of lattice points 
that are contained in P. 

Thus, when the body P is an integral polytope, the error terms that measure 
the discrepancy between the discrete volume and the usual continuous volume 
are quite nice; they are given by Ehrhart polynomials, and these enumeration 
polynomials are the content of Chapter 3. 


The Fourier—Dedekind Sums Are the Building Blocks: Number 
Theory 


Every polytope has a discrete volume that is expressible in terms of certain 
finite sums that are known as Dedekind sums. Before giving their definition, 
we first motivate these sums with some examples that illustrate their building- 
block behavior for lattice-point enumeration. To be concrete, consider, for 
example, a 1-dimensional polytope given by an interval P = [0,a], where a is 
any positive real number. It is clear that we need the greatest integer function 
|x| to help us enumerate the lattice points in P, and indeed, the answer is 
la] +1. 

Next, consider a 1-dimensional line segment that is sitting in the 2- 
dimensional plane. Let’s choose our segment P so that it begins at the origin 
and ends at the lattice point (c,d). As becomes apparent after a moment’s 
thought, the number of lattice points on this finite line segment involves an 
old friend, namely the greatest common divisor of c and d. The exact number 
of lattice points on the line segment is gcd(c, d) + 1. 
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To unify both of these examples, consider a triangle P in the plane whose 
vertices have rational coordinates. It turns out that a certain finite sum is 
completely natural because it simultaneously extends both the greatest integer 
function and the greatest common divisor, although the latter is less obvious. 
An example of a Dedekind sum in two dimensions that arises naturally in the 
formula for the discrete volume of the rational triangle P is the following: 


oat) = (3) (M2 |] 2). 


m=1 


The definition makes use of the greatest integer function. Why do these 
sums also resemble the greatest common divisor? Luckily, the Dedekind sums 
satisfy a remarkable reciprocity law, quite similar to the Euclidean algorithm 
that computes the greatest common divisor. This reciprocity law allows the 
Dedekind sums to be computed in roughly log(b) steps rather than the b steps 
that are implied by the definition above. The reciprocity law for s(a, b) lies 
at the heart of some amazing number theory that we treat in an elementary 
fashion, but that also comes from the deeper subject of modular forms and 
other modern tools. 

We find ourselves in the fortunate position of viewing an important summit 
of an enormous mountain of ideas, submerged by the waters of geometry. As 
we delve more deeply into these waters, more and more hidden beauty unfolds 
for us, and the Dedekind sums are an indispensable tool that allow us to see 
farther as the waters get deeper. 


The Relevant Solids Are Polytopes: Geometry 


The examples we have used, namely line segments and polygons in the plane, 
are special cases of polytopes in all dimensions. One way to define a polytope 
is to consider the convex hull of a finite collection of points in Euclidean 
space R?. That is, suppose someone gives us a set of points v,,...,Vn in 
R?. The polytope determined by the given points v,; is defined by all linear 
combinations cj v1 +¢2V2+--::+¢nVn, where the coefficients c; are nonnegative 
real numbers that satisfy the relation cy + cg +---+c, = 1. This construction 
is called the vertex description of the polytope. 

There is another equivalent definition, called the hyperplane description 
of the polytope. Namely, if someone hands us the linear inequalities that 
define a finite collection of half-spaces in R?, we can define the associated 
polytope as the simultaneous intersection of the half-spaces defined by the 
given inequalities. 

There are some “obvious” facts about polytopes that are intuitively clear 
to most students but are, in fact, subtle and often nontrivial to prove from 
first principles. One of these facts, namely that every polytope has both a 
vertex and a hyperplane description, forms a crucial basis to the material we 
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will develop in this book. We carefully prove this fact in the appendix. The 
statement is intuitively clear, so that novices can skip over its proof without 
any detriment to their ability to compute continuous and discrete volumes 
of polytopes. All theorems in the text (including those in the appendix) are 
proved from first principles, with the exception of Chapter 14, where we 
assume basic notions from complex analysis. 


Chapter 1 
The Coin-Exchange 
Problem of Frobenius 


\ ie 


Chapter 3 
Counting Lattice Points in 
Polytopes: The Ehrhart Theory 


Chapter 2 
A Gallery of Discrete Volumes 
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Dehn-Sommerville Relations 
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Chapter 13 


Euler—MacLaurin Solid Angles 


Summation in R¢@ 


Fig. 0.2 The partially ordered set of chapter dependencies 


The text naturally flows into two parts, which we now explicate. 
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Part I 


We have taken great care in making the content of the chapters flow seamlessly 
from one to the next, over the span of the first six chapters. 


Chapters 1 and 2 introduce some basic notions of generating functions, in 
the visually compelling context of discrete geometry, with an abundance of 
detailed motivating examples. 

Chapters 3, 4, and 5 develop the full Ehrhart theory of discrete volumes of 
rational polytopes. 

Chapter 6 is a “dessert” chapter, in that it enables us to use the theory 
developed to treat the enumeration of magic squares, an ancient topic that 
enjoys active current research. 


Part IT 


We now begin anew. 


Having attained experience with numerous examples and results about 
integral polytopes, we are ready to learn about the Dedekind sums of Chap- 
ter 8, which form the atomic pieces of the discrete-volume polynomials. On 
the other hand, to fully understand Dedekind sums, we need to understand 
finite Fourier analysis, which we therefore develop from first principles in 
Chapter 7, using only partial fractions. 

In Chapter 9, we study a concrete class of polytopes—projections of cubes, 
which go by the name zonotopes—whose discrete volume is tractable and 
has neat connections to number theory and graph theory. 

Chapter 10 develops inequalities among the coefficients of an Ehrhart poly- 
nomial, based on a polynomial decomposition formula that arises naturally 
from the arithmetic and combinatorial data of triangulations. 

Chapter 11 answers a simple yet tricky question: how does a finite geometric 
series in one dimension extend to higher-dimensional polytopes? Brion’s 
theorem gives an elegant and decisive answer to this question. 

In Chapter 12, we extend the interplay between the continuous volume and 
the discrete volume of a polytope (already studied in detail in Part I) by 
introducing Fuler-Maclaurin summation formulas in all dimensions. These 
formulas compare the continuous Fourier transform of a polytope to its 
discrete Fourier transform, yet the material is completely self-contained. 
Chapter 13 develops an exciting extension of Ehrhart theory that defines 
and studies the solid angles of a polytope; these are the natural extensions 
of 2-dimensional angles to higher dimensions. 

Finally, we end with another “dessert” chapter that uses complex-analytic 
methods to find an integral formula for the discrepancy between the discrete 
and continuous areas enclosed by a closed curve in the plane. 
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Because polytopes are both theoretically useful (in triangulated manifolds, 
for example) and practically essential (in computer graphics, for example) we 
use them to link results in number theory and combinatorics. Many research 
papers have been written on these interconnections, and it is impossible to 
capture them all here, especially since some are being written even as we are 
writing this sentence! However, we hope that these modest beginnings will 
give the reader who is unfamiliar with these fields a good sense of their beauty, 
inexorable connectedness, and utility. We have written a gentle invitation to 
what we consider a gorgeous world of counting and of links between the fields 
of combinatorics, number theory, and geometry for the general mathematical 
reader. 

There are a number of excellent books that have a nontrivial intersection 
with ours and contain material that complements the topics discussed here. 
We heartily recommend the monographs of Barvinok [20,21] (on general 
convexity topics and Ehrhart theory), Bruns—Gubeladze [73] (on commutative 
algebra and K-theory connected with polytopes), De Loera~Hemmecke—K6ppe 
[95] (on connections to optimization), De Loera~-Rambau-Santos [98] (on 
triangulations of point configurations), Ehrhart [111] (the historic introduction 
to Ehrhart theory), Ewald [112] (on connections to algebraic geometry), 
Hibi [134] (on the interplay of algebraic combinatorics with polytopes), Miller— 
Sturmfels [177] (on combinatorial commutative algebra), and Stanley [230] 
(on general enumerative problems in combinatorics). 
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Part I 


The Essentials of Discrete Volume 
Computations 


Chapter 1 


The Coin-Exchange Problem of 
Frobenius 


The full beauty of the subject of generating functions emerges only from tuning in on both 
channels: the discrete and the continuous. 


Herbert Wilf [252] 


Suppose we are interested in an infinite sequence of numbers (a,);-) that 
arises geometrically or recursively. Is there a “good formula” for az, as a 
function of k? Are there identities involving various a,;’s? Embedding this 
sequence into the generating function 


F@)= >" a2" 


k>0 


allows us to retrieve answers to the questions above in a surprisingly quick 
and elegant way. We can think of F(z) as lifting our sequence a, from its 
discrete setting into the continuous world of functions. 


1.1 Why Use Generating Functions? 


To illustrate these concepts, we warm up with the classic example of the 
Fibonacci sequence f;, named after Leonardo Pisano Fibonacci (c. 1170- 
c. 1250)! and defined by the recurrence relation 


fo = 9, fi =1, and fetro = fesi t+ fr for k>0. 


This gives the sequence (fx)p-p = (0,1, 1, 2, 3,5, 8, 18, 21, 34,...) (see also [1, 
Sequence A000045]). Now let’s see what generating functions can do for us. 


1 For more information about Fibonacci, see 
http://www-history.mcs.st-andrews.ac.uk/Biographies/Fibonacci.html. 
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Let 
F(z) := S- fr2*. 


k>0 


We embed both sides of the recurrence identity into their generating functions: 


S> fet2 z= S$" (feria t fe) 2* => fess i ae (1.1) 


k>0 k>0 k>0 k>0 


The left-hand side of (1.1) is 


Yo fee? = fae = SY feat = SFR) -9), 


k>0 k>0 k>2 


while the right-hand side of (1.1) is 


fess 2 + Do fer = SF) + FQ). 


k>0 k>0 


So (1.1) can be restated as 


5 (F(2)— 2) = SF) + FO), 
PO =e 


It is fun to check (e.g., with a computer) that when we expand the function 
F into a power series, we indeed obtain the Fibonacci numbers as coefficients: 


z 


i f= ec 274223 4324452548 2941327 +2122 4+34294---. 
—Z-2 


Now we use our favorite method of handling rational functions: the partial 
fraction expansion. In our case, the denominator factors as 1 — z — z? = 
(1 1+v5 z) (1 1-v5 2), and the partial fraction expansion is (see Exer- 


cise 1.1) 


7 Hie ie (1.2) 
l=2=27 4 14+V5 » 1 Ee , 


The two terms suggest the use of the geometric series 


‘il 
eS er (1.3) 


k>0 


(see Exercise 1.2) with « = 14V5 zand x= 1-5 z, respectively: 
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Comparing the coefficients of z* in the definition of F(z) = 7,39 fe 2" and 
the new expression above for F'(z), we discover the closed-form expression 


k k 
Ao (it v5 1 /1-v5 
"V5 \ 2 v5\ 2 
for the Fibonacci sequence. 
This method of decomposing a rational generating function into partial 
fractions is one of our key tools. Because we will use partial fractions time 


and again throughout this book, we record the result on which this method is 
based. 


Theorem 1.1 (Partial fraction expansion). Given a rational function 


To p(2) 
oe ea)" 


where p is a polynomial of degree less than e; +e2 +--:+€m and the az are 
distinct complex numbers, there exists a decomposition 


m 
Ck Ck,2 Ck,ex, 
ree) = 5 (4 oo att), 


2 
k=1 (z — ax) 


where cry € C are unique. 


One possible proof of this theorem is based on the fact that the polynomials 
form a Euclidean domain. For readers who are acquainted with this notion, 
we outline this proof in Exercise 1.37. 


1.2 Two Coins 


Let’s imagine that we introduce a new coin system. Instead of using pennies, 
nickels, dimes, and quarters, let’s say that we agree on using 4-cent, 7-cent, 
9-cent, and 34-cent coins. The reader might point out the following flaw of this 
new system: certain amounts cannot be obtained (that is, created with the 
available coins), for example, 2 or 5 cents. On the other hand, this deficiency 
makes our new coin system more interesting than the old one, because we 
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can ask the question, “which amounts can be obtained?” In fact, we will 
prove in Exercise 1.20 that there are only finitely many integer amounts 
that cannot be obtained using our new coin system. A natural question, 
first tackled by Ferdinand Georg Frobenius (1849-1917),? and James Joseph 
Sylvester (1814-1897)? is, what is the largest amount that cannot be changed? 
As mathematicians, we like to keep questions as general as possible, and so 


we ask, given coins of denominations a1, da2,...,@q that are positive integers 
without any common factor, can you give a formula for the largest amount 
that cannot be obtained using the coins a), @2,...,aq? This problem is known 


as the Frobenius coin-exchange problem. 
To be precise, suppose we are given a set of positive integers 


A= {a1,@2,...,@a} 


with gcd (a1, a2,...,@q¢) = 1. We call a positive integer n representable by 
the set A if there exist nonnegative integers m,,mg2,...,Mq such that 


nm=mMmyay, +++: +Maqdag.- 


In the language of coins, this means that we can obtain the amount n 
using the coins a1, d@2,...,@qa. The Frobenius problem (often called the linear 
Diophantine problem of Frobenius) asks us to find the largest integer that is 
not representable. We call this largest integer the Frobenius number and 
denote it by g(a1,...,@a). The following theorem gives us a pretty formula 
for d= 2. 


Theorem 1.2. [fa and ag are relatively prime positive integers, then 
g (a1, a2) = 4€1a2 — a, — a9. 


This simple-looking formula for g inspired a great deal of research into 
formulas for g (a1, a@2,...,@a) with only limited success; see the notes at the 
end of this chapter. For d = 2, Sylvester gave the following result. 


Theorem 1.3 (Sylvester’s theorem). Let a, and az be relatively prime 
positive integers. Exactly half of the integers between 1 and (a; — 1) (az — 1) 
are representable by {a1, a2}. 


Our goal in this chapter is to prove these two theorems (and a little more) 
using the machinery of partial fractions. We approach the Frobenius problem 
through the study of the restricted partition function 


pa(n) = #{(m1,...,ma) € Z*: all m, > 0, mya, +++++maag =n}, 


2 For more information about Frobenius, see 
http://www-history.mcs.st-andrews.ac.uk/Biographies/Frobenius.html. 


3 For more information about Sylvester, see 
http://www-history.mcs.st-andrews.ac.uk/Biographies/Sylvester.html. 
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the number of partitions of n using only the elements of A as parts.* In view 
of this partition function, g(ai,...,@qa) is the largest positive integer n for 
which pa(n) = 0. 

There is a beautiful geometric interpretation of the restricted partition 
function. The geometric description begins with the set 


P= f (igs254) ER?: all xr; >0, £10, +-++++¢qaq = 1}. (1.4) 
The nt® dilate of a set S C R?@ is 
{(na1,nx2,...,NXa): (@1,...,%a) € S}. 


The function pa(n) counts precisely those integer points in Z? that lie in the 
n‘) integer dilate of the body P. The set Z% is an example of a lattice,? and 
so integer points are often called lattice points. The dilation process in this 
context is tantamount to replacing x1;a; +-:-+2qaq = 1 in the definition of 
P by x14, +--+: +2aaq = n. The set P turns out to be a polytope. We can 
easily picture P and its dilates for dimension d < 3; Figure 1.1 shows the 
3-dimensional case. 


Fig. 1.1 The polytope P 
for d= 3. 


4 A partition of a positive integer n is a multiset (i.e., a set in which we allow repetition) 
{n1,n2,...,n~} of positive integers such that n = ni + ng +--+: + ng. The numbers 
n1,N2,...,N~ are called the parts of the partition. 


5 A lattice is a discrete subgroup of R®, where d is a positive integer. 
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1.3 Partial Fractions and a Surprising Formula 


We first concentrate on the case d = 2 and study 
Prapy(n) = #{(k,1) €Z?: k,l >0, ak+bl=n}. 


Recall that we require a and b to be relatively prime. To begin our discussion, 
we play around with generating functions. Consider the following product of 
two geometric series: 


1 1 
(5) (ais) “error entestemeny 


(see Exercise 1.2). If we multiply out all the terms, we obtain a power series all 
of whose exponents are linear combinations of a and b. In fact, the coefficient 
of z” in this power series counts the number of ways that n can be written as 
a nonnegative linear combination of a and b. In other words, these coefficients 
are precisely evaluations of our counting function prqp}: 


(; “) (3) = eta" =) praay(n) 2 


k>0 1>0 n>0 


So this function is the generating function for the sequence of integers 
(Dfa,p4(n n)) . The idea is now to study the compact function on the left. 

We would. like to uncover an interesting formula for prq4;(m) by looking at 
the generating function on the left more closely. To make our computational 
life easier, we study the constant term of a related series; namely, psq,p}(7) is 
the constant term of 


ff) = Saree = > 7Pta,oy (k) a 


k>0 


The latter series is not quite a power series, since it includes terms with negative 
exponents. These series are called Laurent series, after Pierre Alphonse Laurent 
(1813-1854). For a power series (centered at 0), we could simply evaluate the 
corresponding function at z = 0 to obtain the constant term; once we have 
negative exponents, such an evaluation is no longer possible. However, if we 
first subtract all terms with negative exponents, we obtain a power series 
whose constant term (which remains unchanged) can now be computed by 
evaluating this remaining function at z = 0. 

To be able to compute this constant term, we will expand f into partial 
fractions. As a warmup to partial fraction decompositions, we first work out 
a 1-dimensional example. Let’s denote the a*” root of unity e27'/¢ by &q: 


€q = 7/4 = cos 4 at isin 22 ; 
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then the collection of all a® roots of unity comprises 1, €,,€?,€3,...,€271. 


Example 1.4. Let’s find the partial fraction expansion of —. The poles of 


this function are located at all at® roots of unity €* for k =0,1,...,a—1. So 
we expand 
1— z — gk 
To find the coefficients C;,, we proceed as follows: 
1 1 K 
Cy = li —& = = —74 
: eek (2~&) (; =) aoek —aze-1 a’ 


where we have used L’Hopital’s rule in the penultimate equality. Therefore, 
we arrive at the expansion 


biel 
l-24 a 44 z—éko 


k=0 


Returning to restricted partitions, the poles of f are located at z = 0 with 
multiplicity n, at z = 1 with multiplicity 2, and at all the other a* and b*® 
roots of unity with multiplicity 1, because a and b are relatively prime. Hence 
our partial fraction expansion looks like this: 


= Al | Aa An, By | By Ck ne Dd; 
f(z) = a 2 i ey yn al ea ee —@° (1.5) 


We invite the reader to compute the coefficients (Exercise 1.21) 


iL 
Ck = ; (1.6) 
a (1 — Ebb) Eger") 


1 
b(1 = qe) pel) 


To compute Bz, we multiply both sides of (1.5) by (z — 1)? and take the limit 
as z — 1 to obtain 


D;= 


. (z—1)? 1 
By =1 = 
2 1 (1 —24)(1— 2) 2" ad’ 


by applying L’Hopital’s rule twice, for example. For the more interesting 
constant By, we compute 
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again by applying L’Hopital’s rule. 

We do not need to compute the coefficients A;,..., An, since they contribute 
only to the terms with negative exponents, which we can safely neglect, because 
those terms are different from the constant term of f. Once we have the other 
coefficients, the constant term of the Laurent series of f is—as we said 
above—the following function evaluated at 0: 


a-1 


By Bo G. D; 
oe ae 
eo IP a he eS z=0 
b-1 
Cr D; 
=a +B.-F 4-52 
j=1 Sb 
With (1.6) in hand, this simplifies to 
a—1 6-1 
1 1 n 1 1 1 1 
= —— (LT 
Pra) = 95% ot ab a » (le jee Xa a ee i 


Encouraged by this initial success, we now proceed to analyze each sum in 
(1.7) with the hope of recognizing them as more familiar objects. 

For the next step, we need to define the greatest-integer function ||, 
which denotes the greatest integer less than or equal to z. A close sibling 
to this function is the fractional-part function {x} = x — ||. To readers 
not familiar with the functions |x| and {a} we recommend working through 
Exercises 1.3-1.5. 

What we will do next is to study a special case, namely b = 1. This is 
appealing, because ps_,1}(n) simply counts integer points in an interval: 


P{ar}(n) = # {(k, 1) eZ: k,l > 0, ak+l=n} 
=#{keZ: k>0, ak<n} 


=#{keZ: 0<k<"} 


(See Exercise 1.3.) On the other hand, in (1.7), we just computed a different 
expression for this function, so that 


1 to 
2a oe =H ékn = P{a,i}(n) = =| +1. 


k=1 


With the help of the fractional-part function {~%} = x — |a]|, we have derived 
a formula for the following sum over the nontrivial a‘® roots of unity: 
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Tc. 1 . : ; 
ay CER lal 5 og! (1.8) 


We are almost there: we invite the reader (Exercise 1.22) to show that 


= 1 1 a-1 1 
ae (1 — €bk) gkn = oe (1 — €F) gb-tkn ; (1.9) 


where b~! is an integer such that b~'b = 1 mod a, and to conclude that 


1 1 b-!n 1 1 
2 Tae = { : bas Ya oa, 


Now all that is left to do is to substitute this expression back into (1.7), which 
yields the following beautiful formula due to Peter Barlow (1776-1862)° and 
(in the form we state here) Tiberiu Popoviciu (1906-1975). 


Theorem 1.5 (Barlow—Popoviciu formula). Jf a and b are relatively 


prime, then 
(n) n bon a—'n i 
n)= 
P{a,b} ab A b , 


where b~'b =1 mod a and a~'a =1 mod b. 


si p{a,7} (2) 


15 


10 


Fig. 1.2 The graph of py4.7}(n). 


8 For more information about Barlow, see 
http://www-history.mcs.st-andrews.ac.uk/Biographies/Barlow.html. 
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We shall have more to say about the arithmetic nature of restricted partition 
functions in general; for now, let’s note that the function p,q_.,}(n) behaves 
roughly linearly, with a periodic part that stems from the fractional-part 
functions. Figure 1.2 shows the behavior of the function for a = 4, b = 7. 

Theorem 1.5 can be neatly and easily rewritten using greatest-integer 
functions, as follows (Exercise 1.28): 


Corollary 1.6. Given relatively prime positive integers a and b, let p and q 
be positive integers such that aq — bp = 1. Then 


Pfap}(N) = He oa [=| ifatn, 


|[e|-P2+1 ifaln. 


1.4 Sylvester’s Result 


Before we apply Theorem 1.5 to obtain the classical Theorems 1.2 and 1.3, we 
return for a moment to the geometry behind the restricted partition function 
P{a,b}(7). In the 2-dimensional case (which is the setting of Theorem 1.5), we 
are counting integer points (x,y) € Z? on the line segments defined by the 
constraints 

ax + by =n, r,y>Oo. 


Thus this line segment connects the points (0, 7) and (",0); we denote this 
line segment by [(0, #), (%,0)]. (More generally, we define the closed line 
segment joining the points x and y € R®@ as follows: 


x,y] = {Ax+(1-A)y: 0<SA< J}, 


and we will use the usual similar notation to denote an open line segment 
(x,y) and a half-open line segment (x, y].) 

As n increases, the line segment [(0, 7), (%,0)] becomes dilated. It is not 
too far-fetched (although Exercise 1.13 teaches us to be careful with such 
statements) to expect that the likelihood for an integer point to lie on the 
line segment increases with n. In fact, one might even guess that the number 
of points on the line segment increases linearly with n, since the line segment 
is a 1-dimensional object. Theorem 1.5 quantifies the previous statement in 
a very precise form: pyq,4}(7) has the “leading term” 77, and the remaining 
terms are bounded as functions of n. Figure 1.3 shows the geometry behind 
the counting function pr4,7}(n) for the first few values of n. Note that the 
thick line segment for n = 17 = 4: 7—4-—7 is the last one that does not 


contain any integer point. 


Lemma 1.7. Ifa and b are relatively prime positive integers and n € [1, ab—1] 
is not a multiple of a or b, then 


1.4 Sylvester’s Result 13 


y 

3 io) fo) e fo) e 
2 9) fo) e fo) io) 
1 co) fo) e 


Fig. 1.3 The nonnegative integer solutions to 4a + 7y = n for n = 0,1,2,.... 


P{a,b}(%) + Pfa,o} (ab —n) = 1. 


In other words, for n between 1 and ab—1 and not divisible by a or b, exactly 
one of the two integers n and ab —n is representable in terms of a and b. 


Proof. This identity follows directly from Theorem 1.5: 


east n) = = {Pia foro 4, 


ab 


Here, (x) follows from the fact that {—x} = 1— {x} if « ¢ Z (see Exercise 1.5). 


Proof of Theorem 1.2. We have to show that p,_,}(ab — a — b) = 0 and that 
P{ap}(n) > 0 for every n > ab—a— b. The first assertion follows from 
Exercise 1.24, which states that pya4}(a + b) = 1, and Lemma 1.7. To prove 
the second assertion, we note that { mh <1- + for every integer m. Hence 
for every positive integer n, 


ab—a—b+n 1 1 n 
b b4 > 1 1 fl= 4 
iad} 8 n)2 ab ( “) ( ;) ab vo 
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Proof of Theorem 1.8. Recall that Lemma 1.7 states that for n between 1 and 
ab—1 and not divisible by a or b, exactly one of n and ab— 7 is representable. 
There are 

ab—a—b+1=(a—1)(b—-1) 


integers between 1 and ab—1 that are not divisible by a or b. Finally, we note 
that pa,o}(n) > O ifn is a multiple of a or b, by the very definition of pyq4}(n). 
Hence the number of nonrepresentable integers is (a — 1)(b— 1). 


Note that we have proved even more. Essentially by Lemma 1.7, every 
positive integer less than ab has at most one representation. Hence, the 
representable integers less than ab are uniquely representable (see also Exer- 
cise 1.25). 


1.5 Three and More Coins 


What happens to the complexity of the Frobenius problem if we have more 
than two coins? Let’s return to our restricted partition function 


pa(n) =#{(mi,...,ma) € Z*: all m; > 0, mya, +++: +maaa =n}, 


where A = {a1,...,aq}. By the very same reasoning as in Section 1.3, we can 
easily write down the generating function for p(n): 


2, Paln)2" = (4) i. 7 es 


We use the same methods that were exploited in Section 1.3 to recover our 
function pa(n) as the constant term of a useful generating function. Namely, 


1 
(1- gar) (1— a=) : 


We now expand the function on the right into partial fractions. For reasons of 
simplicity, we assume in the following that a,,...,aq are pairwise relatively 
prime; that is, no two of the integers a,,a2,...,a@q have a common factor. 
Then our partial fraction expansion looks like this: 


pa(n) = const ( 


1 
(1 — 2@1)--- (1 — 244) 2” 
Aj Ag An By, By Ba 


Wa tet ta tit Gap tt Gee OM) 


a,—1 ag—1 ag—1 
Cir Cor Cak 
sD Dicey a ae ye oP eae 


ie)= 
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By now, we are experienced in computing partial fraction coefficients, so that 
the reader will easily verify that (Exercise 1.31) 


1 
ay (1 = -) (1 = -) duns (1 _ ioe) ame . 


As before, we do not have to compute the coefficients A,,..., An, because 
they do not contribute to the constant term of f. For the computation of 
By,,...,Ba, we may use a symbolic manipulation program such as Maple, 
Mathematica, or Sage. Again, once we have calculated these coefficients, we 
can compute the constant term of f by dropping all negative exponents and 
evaluating the remaining function at 0: 


a,—-1 ag—1 
_f Bb _ Ba Cie Cak 
pats) = (P54 : @-18t 63-8 : 5 | 


Ci = (1.12) 


k=1 z=0 
Ga) ey ait es 
= —B,+By—----+(-1)*Ba- So Se - - OS. 
k=1 741 k=1 >4d 


Substituting the expression we found for Cj, into the latter sum over the 
nontrivial a} roots of unity, for example, gives rise to 


a,—1 


1 sy 1 
ns (=a) ah) 0a 


This motivates the definition of the Fourier-Dedekind sum 


kn 


yoo} 
if (Ciytajase yt b) b « (1a 
Ss (ay a2 a ) b » @- -) (= ) mn (- Pe) ( ) 


We will study these sums in detail in Chapter 8. With this definition, we have 
arrived at the following result. 


Theorem 1.8. The restricted partition function for A = {a,,a2,...,daa}, 
where the ax’s are pairwise relatively prime, can be computed as 


pa(n) = By t Bg vee ( 1)*Ba + s—n (a2, 43,...,@d; 41) 


+ Sn (a1, a3, a4, ace . Aq; 42) eae (a1, a2, Shiba jd 130d) é 


Here By, Bo,..., Ba are the partial fraction coefficients in the expansion (1.11). 


Example 1.9. We give the restricted partition functions for d = 3 and 4. 
These closed-form formulas have proven useful in the refined analysis of the 
periodicity that is inherent in the restricted partition function p(n). For 
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example, one can visualize the graph of piq.p,-}(m) as a “wavy parabola,” as its 
formula plainly shows (Figure 1.4 gives an example, the case A = {4,7, 15}): 


Ea 1,11 2 i ae ee 
Pla") = abe 2 ab ac be 12 bc be ac. ab 


a-1l 


1 1 
4 Da (1 — Eh) 7 — Eke) Ekn a b arr — Eke) (1 — eho) ebm 


: 1 
oC d-ae 


_4 dd Ly ee ee 
‘ab ac. be 8la'b cd 


i= 1 
ar ua “ (E— ER) (1 — ER) — ERA En 


a—1 


b 


1 1 
Oh 0 0 Pe 


i= 1 
c > (Leh) (1684) (1 = eee 
d-1 


1 1 
a Te) PY 


Notes 


1. The theory of generating functions has a long and powerful tradition. We 
only touch on its utility. For those readers who would like to dig a little 
deeper into the vast generating-function garden, we strongly recommend 
Herb Wilf’s generatingfunctionology [252] and Laészlé Lovasz’s Combinatorial 
Problems and Exercises [165]. The reader might wonder why we do not stress 
convergence aspects of the generating functions we play with. Almost all of 
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Fig. 1.4 The graph of py47,15}(n). 


our series are geometric series and have trivial convergence properties. In the 
spirit of not muddying the waters of lucid mathematical exposition, we omit 
such convergence details. 


2. The Frobenius problem is named after Georg Frobenius, who apparently 
liked to raise this problem in his lectures [60]. Theorem 1.2 is one of the famous 
folklore results and might be one of the most misquoted theorems in all of 
mathematics. People usually cite James J. Sylvester’s problem in [239], but 
his paper contains Theorem 1.3 rather than Theorem 1.2. In fact, Sylvester’s 
problem had previously appeared as a theorem in [238]. It is not known who 
first discovered or proved Theorem 1.2. It is very conceivable that Sylvester 
knew about it when he came up with Theorem 1.3. 


3. The linear Diophantine problem of Frobenius should not be confused with 
the postage-stamp problem. The latter problem asks for a similar determination, 
but adds an additional independent bound on the size of the integer solutions 
to the linear equation. 


A. Theorem 1.5 has a fascinating history. The earliest appearance of this result 
(in the form of Corollary 1.6) that we are aware of is in a book on elementary 
number theory, from 1811, by Peter Barlow [18, p. 323-325]. The version we 
state in Theorem 1.5 (which is our first example of a quasipolynomial, a term 
we will see frequently in the coming chapters) seems to go back to a paper 
by Tiberiu Popoviciu [195]. The Barlow—Popoviciu formula has since been 
resurrected at least twice [215,245]. 
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Guglielmo Libri (1803-1869),” a mathematical prodigy, wrote down a 
version of the Barlow—Popoviciu formula in terms of trigonometric functions. 
His exceedingly colorful life is perhaps best summed up in the following 
quotataion from [205]: 


Admirable in the salons and incomparably friendly, flexible, with gentle epigrams of 
sweet humour, elegant flattery, a good writer in both French and Italian, a profound 
mathematician, geometer, physicist, knowing history through and through, a very 
analytic and comparative mind ... more expert than an auctioneer or a bookseller 
in the science of books, this man had only one misfortune: he was essentially a thief. 


5. Fourier-Dedekind sums first surfaced implicitly in Sylvester’s work (see, 
e.g., [237]) and explicitly in connection with restricted partition functions 
in [143]. They were rediscovered in [37], in connection with the Frobenius 
problem. The papers [113,208] contain interesting connections to Bernoulli 
and Euler polynomials. We will resume the study of Fourier-Dedekind sums 
in Chapter 8. 


6. As we mentioned above, the Frobenius problem for d > 3 is much harder 
than the case d = 2 that we have discussed. Certainly beyond d = 3, the 
Frobenius problem is wide open, though much effort has been put into its 
study. The literature on the Frobenius problem is vast, and there is still much 
room for improvement. The interested reader might consult the comprehensive 
monograph [201], which surveys the references to almost all articles dealing 
with the Frobenius problem and gives about 40 open problems and conjectures 
related to the Frobenius problem. To give a flavor, we mention two landmark 
results that go beyond d = 2. 

The first one concerns the generating function r(z) := )o,cp 2", where 
R is the set of all integers representable by a given set of relatively prime 
positive integers a1, d2,...,@q- It is not hard to see (Exercise 1.36) that r(z) = 
p(z)/ (1 — 2%) (1 = 2%)+--(1 — 2%) for some polynomial p. This rational 
generating function contains all the information about the Frobenius problem; 
for example, the Frobenius number is the total degree of the rational function 
i= —1(2). Hence the Frobenius problem reduces to finding the polynomial 
p, the numerator of r. Marcel Morales [179,180] and Graham Denham [102] 
discovered the remarkable fact that for d = 3, the polynomial p has either four 
or six terms. Moreover, they gave semiexplicit formulas for p. The Morales— 
Denham theorem implies that the Frobenius number in the case d = 3 is 
quickly computable, a result that is originally due, in various disguises, to 
Jiirgen Herzog [132], Harold Greenberg [124], and J. Leslie Davison [93]. As 
much as there seems to be a well-defined border between the cases d = 2 
and d = 3, there also seems to be such a border between the cases d = 3 
and d = 4: Henrik Bresinsky [64] proved that for d > 4, there is no absolute 


7 For more information about Libri, see 
http://www-history.mcs.st-andrews.ac.uk/Biographies/Libri.html. 
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bound on the number of terms in the numerator p, in sharp contrast to the 
Morales—Denham theorem. 

On the other hand, Alexander Barvinok and Kevin Woods [23] proved that 
for fixed d, the rational generating function r(z) can be written as a “short” 
sum of rational functions; in particular, r can be efficiently computed when d 
is fixed. A corollary of this fact is that the Frobenius number can be efficiently 
computed when d is fixed; this theorem is due to Ravi Kannan [144]. On the 
other hand, Jorge Ramirez-Alfonsin [200] proved that trying to compute the 
Frobenius number efficiently is hopeless if d is left as a variable. 

While the above results settle the theoretical complexity of the computation 
of the Frobenius number, practical algorithms are a completely different 
matter. Both Kannan’s and Barvinok—Woods’s ideas seem complex enough 
that nobody has yet tried to implement them. The currently most competitive 
algorithms include [49, 59, 207]. 


Exercises 


1.1. & Check the partial fraction expansion (1.2): 


z 1/v5 1/V5 


i=2527" ~4 1+V5 » 1 iV » 


1.2. & Suppose z is a complex number, and n a positive integer. Show that 
(l—z)(lt+2+274+:---+2") = ]-27+1, 


and use this to prove that if |z| < 1, then 


> 


k>0 


1.3. & Find a formula for the number of lattice points in [a,b] for arbitrary 
real numbers a and b. 


1.4. Prove the following. Unless stated differently, n € Z and x,y € R. 


(a) |e+n] = |e] +n. 
(b) [ze] + ly) Slet+y) Sle] +lyJ +1 
0 ifz eZ, 
ls al {2 otherwise. 
(d) For n € Zso, | =| =|4|. 
(e) — a x| is the least integer greater than or equal to x, denoted by [«]. 
(f) |a+ 4] is the nearest integer to x (and if two integers are equally close 
to x, it is the larger of the two). 
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(g) |e) +|a+3] = [22]. 

(h) If m and n are positive integers, || is the number of integers among 
1,...,m that are divisible by n. 

(i) & If m € Zyo, n€ Z, then | == | = [=| Li, 


(j) & If m € Zyo, n € Z, then |"—| + 1 is the least integer greater than or 
equal to 7. 


1.5. Rewrite in terms of the fractional-part function as many of the above 
identities as you can make sense of. 


1.6. Suppose m and n are relatively prime positive integers. Prove that 


5 |e) | = Lom yom. 


k=0 j=0 


1.7. Prove the following identities. They will come in handy at least twice: 
when we study partial fractions, and when we discuss finite Fourier series. For 
o,W ER, n€ Zyo, mE Z, 


(ajer= I 


(e) P= 1 
f |e**| = 
(8) gee =e 
n-1 . 
(h) e2rikm/n _ i ifn | Mh 
— 0 otherwise 
n—-1 
‘ Qrik/n _ n 
ie tae ~ Gani/n 1° 


1.8. Suppose m,n € Z and n > 0. Find a closed form for 
n—1 n—-1 

k 2Qrikm/n k Qrikm/n 
ot ee SEL 
k=0 k=0 
(as a function of m and n). 


1.9. & Suppose m and n are relatively prime integers, and n is positive. Show 
that 
Cal :O0<k< n} = oe :0<j< n} 


and 
ore -0<ke n} = soe : 0<j<nbh. 
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Conclude that if f is any complex-valued function, then 


n-1 n—1 
nmimk/n\ _ nij/n 
df G ) » f (c? J ) 


and a os 
» f (or _ yp f fern) 


1.10. Suppose n is a positive integer. If you know what a group is, prove 
that the set Oe :0<k< n} forms a cyclic group of order n (under 
multiplication in C). 


1.11. Fix n € Zyo. For an integer m, let (m mod n) denote the least nonneg- 
ative integer in G; := Z, to which m is congruent. Let’s denote by * addition 
modulo n, and by o the following composition: 


(ep 


defined on the set Gg := { {=} :me Z}. Define the following functions: 


Qrim/n 
7 


@((m mod n)) = e 
J tile es 3 5 
xX ({=}) = (m mod n). 


Prove the following: 


@((m, mod n) * (mz mod n)) = ¢((m1 mod n)) ¢((mz mod n)) , 
w Cas — w (ern 6 w Gu ; 
my m2 my me 
EG ao aa) 

n n n n 
Prove that the three maps defined above, namely ¢, w, and x, are one-to-one. 
Again, for the reader who is familiar with the notion of a group, let G3 be the 
group of n* roots of unity. What we have shown is that the three groups G1, 


Ge, and G3 are all isomorphic. It is very useful to cycle among these three 
isomorphic groups. 


1.12. & Given integers a,b,c,d, form the line segment [(a,b), (c,d)] C R? 
joining the points (a,b) and (c,d). Show that the number of integer points on 
this line segment is ged(a — c,b — d) + 1. 


1.13. Give an example of a line with 
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(a) no lattice points; 
(b) one lattice point; 
(c) an infinite number of lattice points. 


In each case, state—if appropriate—necessary conditions about the (ir)rationa- 
lity of the slope. 


1.14. Suppose a line y = ma +b passes through the lattice points (p,,q,) and 
(p2, q2). Prove that it also passes through the lattice points 


(pi + k(p2 — pi), + k(q2 -)), REZ. 


1.15. Given positive irrational numbers p and q with . + ; = 1, show that 


Zso is the disjoint union of the two integer sequences {|pn] : n € Zso} and 
{|qn] : n € Zyo}. This theorem from 1894 is due to Lord Rayleigh and was 
rediscovered in 1926 by Sam Beatty. Sequences of the form {|pn]| : n € Zso} 
are often called Beatty sequences. 


1.16. Let a,b,c,d € Z. We say that {(a,b) , (c,d)} is a lattice basis of Z? if 
every lattice point (m,n) € Z? can be written as 


(m,n) = p(a,b) +4q (c, d) 


for some p,q € Z. Prove that if {(a,b), (c,d)} and {(e, f),(g, h)} are lattice 
bases of Z?, then there exists an integer matrix M with determinant +1 such 


that ; f 
a _[{e 
oG: 


Conclude that the determinant of (: 1 is J 


1. 


1.17. & Prove that a triangle with vertices on the integer lattice has no other 
interior /boundary lattice points if and only if it has area $. (Hint: You may 
begin by “doubling” the triangle to form a parallelogram.) 


1.18. Let’s define a northeast lattice path as a path through lattice points 
that uses only the steps (1,0) and (0,1). Let ZL, be the line defined by 
x +2y =n. Prove that the number of northeast lattice paths from the origin 
to a lattice point on L,, is the (n + 1)'® Fibonacci number fy+1. 


1.19. Compute the coefficients of the Taylor series of ao? expanded at 
2:= 0 


(a) by a counting argument, 
(b) by differentiating the geometric series. 


Generalize. 


1.20. & Prove that if a,,a2,...,a@q € Zyg do not have a common factor, then 
the Frobenius number g(a1,...,@q) is well defined. 
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1.21. & Compute the partial fraction coefficients (1.6). 
1.22. & Prove (1.9): for relatively prime positive integers a and b, 


{c= a-1 
1 


1 
or ae 0 0-heEe 


where b-'b = 1 mod a, and deduce from this (1.10), namely, 


A 1 _ fay ed 4 
aan (1 — gk) gkn a "2 Ba" 


(Hint: Use Exercise 1.9.) 


1.23. Prove that for relatively prime positive integers a and 8, 


Pfa,b}(m + ab) = prap}(n) +1. 


1.24. & Show that if a and b are relatively prime integers > 2, then 


PLa,b} (a + b) = ll. 


1.25. To extend the Frobenius problem, we say that an integer n is k- 
representable if p4(n) = k; that is, n can be represented in exactly k 
ways using the integers in the set A. Define gx = gx(a1,...,@a) to be the 
largest k-representable integer. Prove: 


(a) Let d = 2. For every k € Zso, there is an N such that all integers larger 
than N have at least k representations (and hence g;,(a, b) is well defined). 

(b) gx (a,b) = (K+ 1)ab—a—b. 

(c) Given k > 2, the smallest k-representable integer is ab(k — 1). 

(d) The smallest interval containing all uniquely representable integers is 
[min(a, b), gi(a, b)]. 

(e) Given k > 2, the smallest interval containing all k-representable integers 
is [9—2(a, b) rat b, gr (a, b)). 

(f) There are exactly ab — 1 integers that are uniquely representable. Given 
k > 2, there are exactly ab k-representable integers. 

(g) Extend all of this to d > 3 (see Open Problem 1.43). 


1.26. Find a formula for p,q} (n). 

1.27. Prove the following recurrence formula for n € Zo: 
lag 
m=0 


Use it in the case d = 2 to give an alternative proof of Theorem 1.2. 
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1.28. Prove the equivalence of Theorem 1.5 and Corollary 1.6. 


1.29. Give an alternative proof of Corollary 1.6 (i.e., one that does not use 
partial fraction expansions of generating functions). 


1.30. Prove the following extension of Theorem 1.5: Suppose gcd(a, b) = d. 


Then 
ee ai onl 41 ifd 
P{a,by(m) = 4 2 { rd ime if d\n, . 
0 otherwise, 


where 8 4 = 1 mod ¢ and a4 = 1 mod e 
1.31. & Compute the partial fraction coefficient (1.12). 
1.32. Find a formula for pyq.p,-}(m) for the case gcd(a, b,c) A 1. 
1.33. & With A = {aj,a2,...,aq} C Zyo, let 
pa(n) = # {(m™m1,..., ma) €Z*: all m; >0, mia, +--+ + maaa =n}; 


that is, p(n) counts the number of partitions of the positive integer n using 
only the elements of A as parts, where each part is used at least once. Find 
formulas for p% for A = {a}, A = {a,b}, A = {a,b,c}, A = {a,b, c,d}, where 
a,b,c,d are pairwise relatively prime positive integers. Observe that in all 
examples, the counting functions p, and p% satisfy the algebraic relation 


Pa(—n) = (-1)*" pa(n). 


(What we mean here is the following: using the algebraic expression for p(n), 
substitute n by —n, even though negative arguments do not make sense in 
terms of the combinatorial definition of p4(n). Then compare this formula 
with the algebraic expression for (—1)4~! p4(n).) 


1.34. Prove that p(n) = pa (n — a1 — ag — +++ — ag) for n > ay+ag4+-+-+aq. 
(Here, as usual, A = {a1,a2,...,@q}.) Conclude that in the examples of 
Exercise 1.33, the algebraic relation 


pa(—n) = (-1)7"' pa (n — ay — ag — +++ — a) 
holds. 


1.35. For relatively prime positive integers a, b, let 
R:={am-+bn: m,n€ Zso}, 
the set of all integers representable by a and 6. Prove that 


Sz = 1— 2% 
~ (l—24)(1- 2)" 


keR 
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Use this rational generating function to give alternative proofs of Theorems 1.2 
and 1.3. 


1.36. For relatively prime positive integers a1, a2,...,d@a, let 
R:= {mya, + mgaq +--+ + mada: M1,Me2,...,Ma € Zso}, 
the set of all integers representable by a1, a2,...,aq. Prove that 


r(z):= ye zk — p(2) 


— 1 — 2%) (1 — z@)--- (1 — 244) 
for some polynomial p. 


1.37. Prove Theorem 1.1: Given a rational function Neancaeas where p is 
k=1 
a polynomial of degree less than e; + e2 +--+: + em and the a,x are distinct, 


there exists a decomposition 


m 
S- Chl, Ck,2 Chen 
T ea | 7" 
z—Ak (z—ax)° (z — ax) ° 


k=1 


where the cy,; € C are unique. 

Here is an outline of one possible proof. Recall that the set of polynomials 
(over R or C) forms a Euclidean domain, that is, given two polynomials 
a(z), b(z), there exist polynomials q(z),r(z) with deg(r) < deg(b) such that 


a(z) = b(z) q(z) + r(z). 


Applying this procedure repeatedly (the Euclidean algorithm) gives the great- 

est common divisor of a(z) and b(z) as a linear combination of a(z) and 6(z), 

that is, there exist polynomials c(z) and d(z) such that a(z)c(z) + b(z)d(z) = 

gcd (a(z), b(z)). 

Step 1: Apply the Euclidean algorithm to show that there exist polynomials 
uz, Ug such that 


ur(z) (z — a1)*! + ue(z) (2 — ag)? = 1. 


Step 2: Deduce that there exist polynomials v1, v2 with deg (vg) < ex such 
that 
(2) _ vile), ale) 
(z — a1)“ (z — a2)” (z— a) (z — a2)? * 


(Hint: Long division.) 
Step 3: Repeat this procedure to obtain a partial fraction decomposition for 
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Open Problems 


1.38. Come up with a new approach or a new algorithm for the Frobenius 
problem in the d = 4 case. 


1.39. There are a very good lower [93] and several upper bounds [201, Chap- 
ter 3] for the Frobenius number. Come up with improved upper bounds. 


1.40. Solve Vladimir I. Arnold’s Problems 1999-2008 through 1999-2011 [11]. 
To give a flavor, we mention two of the problems explicitly: 


(a) Explore the statistics of g (a,,a2,...,@q) for typical large a1, a2,..., aq. It 
is conjectured that g (a1, a2,...,a@a) grows asymptotically like a constant 
times 4—\/a1a2--- aq. (See [4,5, 115].) 

(b) Determine what fraction of the integers in the interval [0, g (a1, a2,..., @a)] 
are representable, for typical large a1, a2,...,@q. It is conjectured that 
this fraction is asymptotically equal to 4 (Theorem 1.3 implies that this 
conjecture is true in the case d = 2.) 


1.41. Study vector generalizations of the Frobenius problem [203, 221]. 


1.42. There are several special cases of A = {a1,a2,...,@a} for which the 
Frobenius problem is solved, for example arithmetic sequences [201, Chapter 3]; 
see also [216, 246, 247]. Study these special cases in light of the generating 
function r(a), defined in the notes and in Exercise 1.36. 


1.43. Study the generalized Frobenius number g;, (defined in Exercise 1.25; 
see also [41,44,217]), e.g., in light of the Morales-Denham theorem mentioned 
in the notes. Derive formulas for special cases, e.g., arithmetic sequences. 


1.44. For which 0 < n < b—1 is Sp (a1, @2,...,aa;b) = 0? (See also Open 
Problem 8.24.) 


Chapter 2 
A Gallery of Discrete Volumes 


Few things are harder to put up with than a good example. 


Mark Twain (1835-1910) 


A unifying theme of this book is the study of the number of integer points in 
polytopes, where the polytopes live in a real Euclidean space R¢. The integer 
points Z? form a lattice in R?, and we often call the integer points lattice 
points. This chapter carries us through concrete instances of lattice-point 
enumeration in various integral and rational polytopes. There is a tremendous 
amount of research taking place along these lines, even as the reader is looking 
at these pages. 


2.1 The Language of Polytopes 


A polytope in dimension 1 is a closed interval; the number of integer points 
in [¢, Al is easily seen to be | S| - [2+ | (Exercise 2.1; here we assume that 
a,b,c,d € Z with ¢ < $). A 2-dimensional convex polytope is a convex 
polygon: a compact convex subset of R? bounded by a simple closed curve 
that is made up of finitely many line segments. 

In general dimension d, a convex polytope is the convex hull of finitely 
many points in R%. To be precise, for a finite point set {v1,v2,...,vn} C R%, 
the polytope P is the smallest convex set containing those points; that is, 


P= Oud devo bso Ay? aS ond Dy hp, = 1), 


This definition is called the vertex description of P, and we use the notation 


P = conv {v1,V2,.--,Vn}, 
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the convex hull of v1, v2,...,V,. In particular, a polytope is a closed subset of 
R?. Many polytopes that we will study, however, are not defined in this way, but 
rather as bounded intersections of finitely many half-spaces and hyperplanes. 
One example is the polytope P defined by (1.4) in Chapter 1. (A set, bounded 
or not, that can be described as the intersection of finitely many half-spaces 
and hyperplanes is a polyhedron.) This hyperplane description of a 
polytope is, in fact, equivalent to the vertex description. The fact that every 
polytope has both a vertex and a hyperplane description is highly nontrivial, 
both algorithmically and conceptually. We carefully work out a proof in 
Appendix A. 
The dimension of a polytope P is the dimension of the affine space 


spanP := {x + \(y—x): xy EP,AER} 


spanned by P. If P has dimension d, we use the notation dim P = d and call 
P a d-polytope. Note that P C R@ does not necessarily have dimension d. For 
example, the polytope P defined by (1.4) has dimension d — 1. 

For a convex polytope P C R%, we say that the hyperplane H = 
{x €R¢:a-x= b} is a supporting hyperplane of P if P lies entirely 
on one side of H, that is, 


Pc{xeRt:a-x<b} or Pc{xeRt:a-x>bd}. 


A face of P is a set of the form PM H, where H is a supporting hyperplane 
of P. Note that P itself is a face of P, corresponding to the degenerate 
hyperplane R?,! and the empty set @ is a face of P, corresponding to a 
hyperplane that does not meet P. The (d — 1)-dimensional faces are called 
facets, the 1-dimensional faces edges, and the 0-dimensional faces vertices 
of P. Vertices are the “extreme points” of a polytope. 

A convex d-polytope has at least d+ 1 vertices. A convex d-polytope with 
exactly d+ 1 vertices is called a d-simplex. Every 1-dimensional convex 
polytope is a 1-simplex, namely, a line segment. The 2-dimensional simplices 
are the triangles, the 3-dimensional simplices the tetrahedra. 

A convex polytope P is called integral if all of its vertices have integer 
coordinates,? and P is called rational if all of its vertices have rational 
coordinates. 


1 In the remainder of the book, we will reserve the term hyperplane for nondegenerate 
hyperplanes, i.e., sets of the form {x €R¢:a-x= b}, where not all of the entries of a 
are zero. 


2 Integral polytopes are also called lattice polytopes. 
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2.2 The Unit Cube 


As a warmup example, we begin with the unit d-cube 0 := (0, 1)¢, which 
simultaneously offers simple geometry and an endless fountain of research 
questions. The vertex description of 0 is given by the set of 2% vertices 
{(@1,@2,...,@q): all x, =0 or 1}. The hyperplane description is 


= {(a1,22,...,%4) € R®: 0< a, <1 for all k=1,2,...,d}. 


Thus, there are the 2d bounding hyperplanes 7, = 0, 41 = 1, rz = 0, x2 
1, iia Bg = 0, tq=l1. 

We now compute the discrete volume of an integer dilate of 0. That is, we 
seek the number of integer points {OM Z® for all t € Z\o. Here tP denotes 
the dilated polytope 


{ (i201, bi2y.4.5 tg) : (iis Wie: nag a) € Py 


for a polytope P. What is the discrete volume of 0? We dilate by the positive 
integer ¢, as depicted in Figure 2.1, and count: 


# (£0 NZ*) = #([0,4°NZ*%) = (¢+1)*. 


Fig. 2.1 The 6* dilate of 
in dimension 2. 


We generally denote the lattice-point enumerator for the t*® dilate of 
P CR? by 
L(t) := #(tP NZ‘), 


a useful object that we also call the discrete volume of P. We may also 
think of leaving P fixed and shrinking the integer lattice: 


Lp(t) =# (Pr +2!) 
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With this convention, La(t) = (t +1)%, a polynomial in the integer variable t. 
Notice that the coefficients of this polynomial are the binomial coefficients 
((), defined through 


(") _ m(m=1(m = 2)-+-(m=n+1) Be 


n! 


form € C,n € Zyo. 
What about the interior O° of the cube? The number of interior integer 
points in ¢0° is 


Loe(t) = # (tO° NZ?) = # ((0,t)?7NZ4) =(t-1)%. 


Notice that this polynomial equals (—1)?Zu(—+t), the evaluation of the poly- 
nomial Lo(t) at negative integers, up to a sign. 

We now introduce another important tool for analyzing a polytope P, 
namely the generating function of Lp: 


Ehrp(z = =1+ 5° Lp(t) 


t>1 


This generating function is also called the Ehrhart series of P. 
In our case, the Ehrhart series of ? = UO takes on a special form. To 
illustrate, we define the Eulerian number A (d,k) through? 


be wi = ukeo A (d,k) 2 (2.2) 


(1 — z)¢1 
j>0 


It is not hard to see that the polynomial = A (d,k) z* is the numerator of 
the rational function 


ay? 7 1 d d (1 
“de l=s) “de “de \i=s/* 
nis eS 


d times 


The Eulerian numbers have many fascinating properties, including 


A(d,k) = A(d,d+1—k), 
A(d,k) = (d—k+1)A(d—1,k-1)+kA(d—-1,h), 


d 
S°A(d,k) = d!, (2.3) 


3 There are two slightly conflicting definitions of Eulerian ces in the literature: some- 


times, they are defined through DV y>00 b1)4z = Ele 0 so AGN instead of (2.2). 
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A(é’) = 0-1 (OV a-at 


j=0 


The first few Eulerian numbers A (d,k) for 0 < k < dare 


d=0: 1 

d=1: 01 

d=2: 011 

d=3: 014 1 

d=4: 011111 1 
d=5: 0126 66 26 1 
d=6: 0157 302 302 571 


(see also [1, Sequence A008292]). 
With this definition, we can now express the Ehrhart series of 0 in terms 
of Eulerian numbers: 


Ehro(z) =14+ $°(¢4+1)¢2' = S$ (¢41)¢2 tal Site ‘ 


t>1 t>0 t>1 
Vater 
(fa z)dtt 


To summarize, we have proved the following theorem. 


Theorem 2.1. Let 0 be the unit d-cube. 


(a) The lattice-point enumerator of O is the polynomial 


d 


Lo(t)=(t+I)4=> ee 


k=0 


(b) Its evaluation at negative integers yields the relation 


(—1)?Lo(—t) = Loo(t). 


(c) The Ehrhart series of 0 is Ehro(z) = pS) Sense 


(=z) tt 


2.3 The Standard Simplex 


The standard simplex A in dimension d is the convex hull of the d+ 1 
points e),e2,...,eq and the origin; here e, is the unit vector (0,...,1,...,0), 
with a 1 in the j™ position. Figure 2.2 shows A for d = 3. On the other hand, 
A can also be realized by its hyperplane description, namely 
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v2 


3 
Fig. 2.2 The standard 


simplex A in dimension 3. 


A= { (1,22 ...,24) ER?: 41) +209+---+2¢ <1 and all xy > 0}. 
In the case of the standard simplex, the dilate tA is now given by 
tA = { (29,4545 Hg) ER: 2) +95 4-+++ay < ¢ and all 2, eae 


To compute the discrete volume of A, we would like to use the methods 
developed in Chapter 1, but there is an extra twist. The counting func- 
tions in Chapter 1 were defined by equalities, whereas the standard simplex 
is defined by an inequality. We are trying to count all integer solutions 
(m1,m2,...,ma) € ZE, to 


My +Me+e+++Mg <t. (2.4) 


To translate this inequality in d variables into an equality in d+ 1 vari- 
ables, we introduce a slack variable ma41 € Z>o, which picks up the differ- 
ence between the right-hand and left-hand sides of (2.4). So the number of 
solutions (m1,™m,...,™Ma) € Ze to (2.4) equals the number of solutions 


(mi,me,...,maz1) € ZSG" to 
my +m +++ + ™Ma41 =. 


Now the methods of Chapter 1 apply: 
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# (tA a) Z*) = const * gm » ym2 |... a ymati | ,-t 


m1>0 m2>0 Ma+120 


ae) ws) 


In contrast with Chapter 1, we do not require a partial fraction expansion 
but simply use the binomial series 


Ly (4th) x 
q-am ai ‘7 ee) 


for d > 0. The constant-term identity (2.5) requires us to find the coefficient 
of z' in the binomial series (2.6), which is (“*'). Hence the discrete volume 
of A is given by La(t) = (7), a polynomial in the integer variable t of 
degree d. Incidentally, the coefficients of this polynomial function in t have an 


alternative life in traditional combinatorics: 


d 

1 

Pg 1)?-* stirl(d + 1,4 + 1)t* 
k 


where stirl(n, 7) is the Stirling number of the first kind (see Exercise 2.11). 
We also notice that (2.6) is, by definition, the Ehrhart series of A. 

Let’s repeat this computation for the interior A° of the standard d-simplex. 
Now we introduce a slack variable mg; > 0, so that strict inequality is forced: 


Lac(t = #{(m1,m2,...,ma) € Ze: mi tmet-:-+ma <t} 


a, m1,M2,...,Ma41) € ZO" : mi +me+-+++may =t}. 


Now 
Lae (t) = const ( ». . ( PS zm ears > ymati | -t 
0 


mi> m2>0 Ma+1>0 
= const ( 


Je) 


= const | 2¢+!~* =a) =(77') 
a} 


It is a fun exercise to prove that 


(47) _ (‘,") a7) 
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(see Exercise 2.10). We have arrived at our destination: 


Theorem 2.2. Let A be the standard d-simplex. 
(a) The lattice-point enumerator of A is the polynomial La(t) = ( 


(b) Its evaluation at negative integers yields (—1)4La(—t) = Lac(t). 
(c) The Ehrhart series of A is Ehra(z) = asaya 


2.4 The Bernoulli Polynomials as Lattice-Point 
Enumerators of Pyramids 


There is a fascinating connection between the Bernoulli polynomials and 
certain pyramids over unit cubes. The Bernoulli polynomials B;(x) are 
defined through the generating function 


FO nes y. Belz) x (2.8) 


and are ubiquitous in the study of the Riemann zeta function, among other 
objects; they are named after Jacob Bernoulli (1654—-1705).4 The Bernoulli 
polynomials will play a prominent role in Chapter 12 in the context of Euler— 
Maclaurin summation. The first few Bernoulli polynomials are 


Bo(x) =1, 
Bi(z)=2--, 
2 1 
Bo(x) =« ~B+e, 
1 
Bs(x) = a" — 52° + 52, 
1 
B = 4_9 3 ) as 
4(z#) = 2 L°e+ a an 
5 5 1 
Bs(x) = 2° st + at gr: 
5 1 1 
Be(x) = 2° — 3x° + =a 5 + 7D: 


The Bernoulli numbers are By, := By,(0) (see also [1, Sequences A000367 
& A002445]) and have the generating function 


4 For more information about Bernoulli, see 
http://www-history.mcs.st-and.ac.uk/Mathematicians/Bernoulli_Jacob. html. 
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Lemma 2.3. For integers d>1 andn> 2, 


n—1 


aie 1 — = (Ba(n) — Ba). 


Proof. We play with the generating function of —— 
n-1 
Ba(n) — Ba d ee"? — Lo 
Jo gp ge 
d>0 k=0 k=0 j>0 
= J j-1 cal 
- (Ee) 5 ie ; aan 
j20 j>1 \k=0 


Now compare coefficients on both sides. 


Consider a (d — 1)-dimensional unit cube embedded into R@ and form a 
d-dimensional pyramid by adjoining one more vertex at (0,0,...,0,1), as 
depicted in Figure 2.3. More precisely, this geometric object has the following 
hyperplane description: 


P= { (Cia, s0<4 tha) €R?: 0<21,40,...,%¢-1 <1—a2g< 1}. (2.9) 


By definition, P is contained in the unit d-cube; in fact, its vertices are a 
subset of the vertices of the d-cube. 
if 


MI vy 


l= 
Fig. 2.3 The pyramid P Lo 
in dimension 3. 


We now count lattice points in integer dilates of P. This number equals 


# {(mi,me,...,ma) € Z*: O< me <t—ma<t fork =1,2,...,d—-1}. 
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In this case, we just count the solutions to 0 < mz < t—mg < t directly: once 
we choose the integer ma (between 0 and t), we have t — mq + 1 independent 


choices for each of the integers m1, ™m2,...,™q_1. Hence 
t t+1 1 
_ d-1 _ di 
Lp(t) = Deine) =e = 5 (Balt +2)— Ba), (2.10) 
Ma= c= 


by Lemma 2.3. (Here we need to require d > 2.) This is, naturally, a polynomial 
in t. 
We now turn our attention to the number of interior lattice points in P: 


_ ad. O0< me <t—ma<t 
Lepa(t) = #{ (masma,. yma) €2 * for allk=1,2,...,d—1f° 


By a similar counting argument, 


t-1 t—2 
= 1 
Lpo(t)= S* (t—ma—1)** =) kt = q (Balt - 1) - Ba). 
mg=1 k=0 


Incidentally, the Bernoulli polynomials are known (Exercise 2.15) to have the 
symmetry 
Ba(1 — 2) = (-—1)*Ba(a). (2.11) 


This identity coupled with the fact (Exercise 2.16) that 
Ba =0 for all odd d > 3 (2.12) 


gives the relation 


Next we compute the Ehrhart series of P. We can actually do this in 
somewhat greater generality. Namely, for a polytope Q C R¢~! with vertices 
V1,V2,---;Vm, define Pyr(Q), the pyramid over Q, as the convex hull of 
(v1,0),(v2,0),...,(Vm,0),(0,...,0,1) in R¢. In our example above, the d- 
polytope P is equal to Pyr(Q) for the unit (d — 1)-cube O. The number of 
integer points in t Pyr(Q) is, by construction, 


Lpyx(q)(t) = 1+ Lo(1) + Lo(2) +--+ + Lo(t) =1+ > Loli), 


because in ¢ Pyr(Q), there is one lattice point with xq-coordinate t, we have 
Lg(1) lattice points with xqg-coordinate t — 1, there are Lg(2) lattice points 
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with xq-coordinate t—2, etc., up to Lg(t) lattice points with xq = 0. Figure 2.4 
shows an instance for a pyramid over a square. 


| 
tt 


IX 


tQ 


Fig. 2.4 Counting the 


x 
lattice points in t Pyr(Q). . 


This identity for Lpy,(q)(t) allows us to compute the Ehrhart series of 
Pyr(Q) from the Ehrhart series of Q: 


Ehr 
Theorem 2.4. Ehrpy,(9)(z) = Phro(s) ; 
-—2z 


Proof. 


t 
Ehrpy.(9)(z) =1+ >_> Lpyay() 26 =14+ 50 | 14+ 0 Lely) | 
j=l 


t>1 t>1 


Our pyramid P that began this section is a pyramid over the unit (d— 1)- 
cube, and so 


d-1 k-1 d—-1 k-1 
| pala = 1,h)2 - Mani Ald—1,4)2 
Ehrp(z) = i (1-24 - G—ayei . (2.18) 


Let’s summarize what we have proved for the pyramid over the unit cube. 
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Theorem 2.5. Given d > 2, let P be the d-pyramid 
P= { (1,25 -.+ 50a) eR?: O0< %1,%2,...,%¢-1<1-ag< ie 


(a) The lattice-point enumerator of P is the polynomial 
1 
Lp(t) = d (Ba(t + 2) — Ba). 


(b) Its evaluation at negative integers yields (—1)4Lp(—t) = Lpo(t). 


d=1 — 
(c) The Ehrhart series of P is Ehrp(z) = ist Ae : . 


Patterns are emerging... 


2.5 The Lattice-Point Enumerators of the 
Cross-Polytopes 


Consider the cross-polytope © in R?% given by the hyperplane description 
Q= { (x1, £2, sand Xd) ER?: |x| + |x| se La| < 1} ‘ (2.14) 


Figure 2.5 shows the 3-dimensional instance of ©, an octahedron. The vertices 
of © are (+1,0,...,0),(0,41,0,...,0),...,(0,...,0,+1). 


Fig. 2.5 The cross- 
polytope © in dimension 3. 


To compute the discrete volume of ©, we use a process similar to that of 
Section 2.4. Namely, for a (d — 1)-polytope Q with vertices vj, V2,...,Vm 
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such that the origin is in Q, we define BiPyr(Q), the bipyramid over Q, as 
the convex hull of 


ig Vay) cians OMe Once U4) and (Oreo gale 


In our example above, the d-dimensional cross-polytope is the bipyramid 
over the (d — 1)-dimensional cross-polytope. The number of integer points in 
t BiPyr(Q) is, by construction, 


=24+2)¢ Lols) + Loelt). 


This identity allows us to compute the Ehrhart series of BiPyr(Q) from the 
Ehrhart series of Q, in a manner similar to the proof of Theorem 2.4. We 
leave the proof of the following result as Exercise 2.23. 


Theorem 2.6. If Q contains the origin, then Ehrpipyr(g)(z) = +42 Ehro(z). 


This theorem allows us to compute the Ehrhart series of © effortlessly: 
The cross-polytope © in dimension 0 is the origin, with Ehrhart series ck. 
The higher-dimensional cross-polytopes can be computed recursively through 
Theorem 2.6 as 
(1+z)¢ 


Ehro(z) = aaa: 


Since Ehro(z) = 1+ 90,5, Lo(t) z‘, we can retrieve Lo(t) by expanding 
Ehro(z) into its power series at z = 0: 


1+) area 
Ehro(z) _ A _ i = Zia We 


d d 
d\ , t+d)\ , d t—k+d\ , 
“EWP EC} LOE w 
= t>0 t>k 
d 
d t—k+d\ , 
“EWE a )* 
k=0 t>0 
In the last step, we used the fact that a) = 0 for0<t<k. But then 


1+ 5° Lo(t)z SS @ oul. 


t>1 t>0 k=0 


and hence Lo(t) = pao (t) (") for all ¢ > 1. 
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We finish this section by counting the interior lattice points in tO. We 
begin by noticing, since ¢ is an integer, that 


Loo (t) = # {(mi,m2,...,mMa) € hg : |r| || sa |ma| < t} 
= #{(mi,ma,...,ma) € Z4: [ma] + [mal +--+ + |mal < t- 1} 
Siae=1); 


On the other hand, we can use (2.7): 


Oa") 
“Ee 


er £6409 


Comparing the last two computations, we see that (—1)4Zo(—t) = Loe(t). 
Let’s summarize: 


Theorem 2.7. Let © be the cross-polytope in R®. 


(a) The lattice-point enumerator of © is the polynomial 
4 (d\ (t-k+d 

Lo(t) = ; 
Ea) 


(b) Its evaluation at negative integers yields (—1)*Lo(-t) = Loo(t). 
(c) The Ehrhart series of P is Ehro(z) = ter. 


2.6 Pick’s Theorem 


Returning to basic concepts, we now give a complete account of Lp for all 
integral convex polygons P in R?. Denote the number of integer points inside 
the polygon P by J, and the number of integer points on the boundary of P 
by B. The following result, called Pick’s theorem in honor of its discoverer 
Georg Alexander Pick (1859-1942), presents the astonishing fact that the 
area A of P can be computed simply by counting lattice points: 
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Theorem 2.8 (Pick’s theorem). For an integral convex polygon, 
1 
A=I+ 38 -1. 


Proof. We begin by proving that Pick’s identity has an additive character: 
we can decompose P into the union of two integral polygons P, and P2 by 
joining two vertices of P with a line segment, as shown in Figure 2.6. 


Fig. 2.6 Decomposition of 
a polygon into two. 


We claim that the validity of Pick’s identity for P follows from the validity 
of Pick’s identity for P, and Pz. Denote the area, number of interior lattice 
points, and number of boundary lattice points of Py by Ax, Ip, and Bz, 
respectively, for k = 1,2. Clearly, 


A=A,+ Ag. 


Furthermore, if we denote the number of lattice points on the edge common 
to P, and P2 by L, then 


IP=h+h+Ll-2 and B=B,+ Bz.—2L42. 


Thus 


1 1 1 
Pcpg = Wd dar D a Pit 5B L+1-1 


1 1 
=—1,+-B,-1+4+1 —By—-1. 
Le tat > 2 


This proves the claim. Note that our proof also shows that the validity of 
Pick’s identity for P, follows from the validity of Pick’s identity for P and Po. 

Now, every convex polygon can be decomposed into triangles that share a 
common vertex, as illustrated in Figure 2.7. Hence it suffices to prove Pick’s 
theorem for triangles. Further simplifying the picture, we can embed every 
integral triangle into an integral rectangle, as suggested by Figure 2.8. 

This reduces the proof of Pick’s theorem to proving the theorem for integral 
rectangles whose edges are parallel to the coordinate axes, and for rectangular 
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Fig. 2.7 Triangulation of 
a polygon. 


triangles two of whose edges are parallel to the coordinate axes. These two 
cases are left to the reader as Exercise 2.25. 


triangle in a rectangle. 


Pick’s theorem allows us to count not only the lattice points strictly inside 
the polygon P, but also the total number of lattice points contained in P, 
because this number is 


1 1 
I+B=A-SB+1+B=A+>5B+1. (2.15) 


From this identity, it is now easy to describe the lattice-point enumerator Lp: 


Theorem 2.9. Suppose P is an integral convex polygon with area A and B 
lattice points on its boundary. 


(a) The lattice-point enumerator of P is the polynomial 
>, 1 
p(t) =A? + 5Bttl. 
(b) Its evaluation at negative integers yields the relation 


ipl s Leal): 


(c) The Ehrhart series of P is 


B+1)2+(A+ 3-2) 24 
Ehrp(z) _ (A 2% 1) (A 2 2) a 
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Note that in the numerator of the Ehrhart series, the coefficient of z? is 
L[po(1), and the coefficient of z is Lp(1) — 3. 


Proof. Statement (a) will follow from (2.15) if we can prove that the area of 
tP is At? and that the number of boundary points on ¢tP is Bt, which is the 
content of Exercise 2.26. Statement (b) follows with Lpo(t) = Lp(t) — Bt. 
Finally, the Ehrhart series is 


Ehrp(z) =1+ 5_ Lp(t) 2! 


t>1 
B 
t>0 
23) 2+e2 iF z | 1 


(l—z)3 2 (1-2z)? 1l-z 
(A—241) 224+ (A+ -2)2+1 
(lea)? 


2.7 Polygons with Rational Vertices 


In this section we will establish formulas for the number of integer points in a 
rational convex polygon and its integral dilates. 

A natural first step is to fix a triangulation of the polygon P, which reduces 
our problem to that of counting integer points in rational triangles. However, 
this procedure merits some remarks. After counting lattice points in the 
triangles, we need to reassemble the triangles to form the polygon. But then 
we need to take care of the overcounting on line segments (where the triangles 
meet). Computing the number of lattice points on rational line segments is 
considerably easier than enumerating lattice points in 2-dimensional regions; 
however, it is still nontrivial (see Theorem 1.5). 

After triangulating P, we can further simplify the picture by embedding an 
arbitrary rational triangle in a rational rectangle, as in Figure 2.8. To compute 
lattice points in a triangle, we can first count the points in a rectangle with 
edges parallel to the coordinate axes, and then subtract the number of points 
in three right triangles, each with two edges are parallel to the axes, and 
possibly another rectangle, as shown in Figure 2.8. Since rectangles are easy 
to deal with (see Exercise 2.2), the problem reduces to finding a formula for a 
right triangle two of whose edges are parallel to the coordinate axes. 

We now adjust and expand our generating-function machinery to these 
right triangles. Such a triangle T is a subset of R? consisting of all points 
(x, y) satisfying 


g>— 9S >, ee-+ fy sr 


als 
Qala 
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for some integers a,b,d,e, f,r (with ea + fb < rd; otherwise, the triangle 
would be empty). Because the lattice-point count is invariant under horizontal 
and vertical integer translations and under flipping about the z- or y-axis, 
we may assume that a,b,d,e, f,r > 0 and a,b < d. (One should meditate 
about this fact for a minute.) Thus we arrive at the triangle T depicted in 
Figure 2.9. 


Fig. 2.9 A right rational triangle. 


To make our life a little easier, let’s assume for the moment that e and f 
are relatively prime; we will deal with the general case in the exercises. So let 


T={(u)eRi 225, y> 5, cot tysrh. (2.16) 


To derive a formula for 


ta 
d 


x(t) =#{ (mn) eZ? m> n> F emt inserh, 


we want to use methods similar to those in Chapter 1. As in Section 2.3, we 
introduce a slack variable s: 


tb 
Lr (t) = #{ (mn) € 2? me 4,n>F,em+snser} 
a m,n, s) € Z3 pe ae SU: em+fn+s=tr>. 
d d 


This counting function can now, as earlier, be interpreted as the coefficient of 
2” in the function 
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> | {| (re 


t b 
m> n>? s20 


Here the subscript (e.g., m > 2) under a summation sign means sum over 
all integers satisfying this condition. For example, in the first sum, we begin 
with the least integer greater than or equal to “*, which is denoted by | “¢] 
(and is equal to | +*| +1 by Exercise 1.4(j)). Hence the above generating 
function can be rewritten as 


em n s [Fle au 1 
= ° 2 ee dU? er er, 1l-z 


mf] J Anaya 7 \ez0 


where we have introduced, for ease of notation, 
t tb 
i= | e€ and Vis 5 ie (2.18) 


To extract the coefficient of 2” of our generating function (2.17), we use 
familiar methods. As usual, we shift this coefficient to a constant term: 


guru—tr 
Lr(t) = const (z 2°) (1 zt) (1 5) 


= const (7 2) (1 a ( sas) 


Before we apply the partial fraction machinery to this function, we should 
make sure that it is indeed a proper rational function, that is, that the total 
degree satisfies 


u+tv—tr—e—f—-1<0 (2.19) 


(see Exercise 2.33). Then we expand into partial fractions (here we are using 
our assumption that e and f do not have any common factors): 


(1— 2°) (1 — 2f) (1 — z)ztr-uv 


e-1 A. f-1 ; 3 oe _ 
=) as ae he D p+ 3, (2.20) 


As we have seen numerous times before, the coefficients D; do not contribute 
to the constant term, so that we obtain 
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e—1 A. f-1 B 
E7t)=-)> 4 - > a -GAGa-G. (2:21) 
We invite the reader to compute the coefficients appearing in this formula 
(Exercise 2.34): 
eet 
e(1-a/) a-a) 
l(u—tr+1) 
Sf 
f(1-eF) -&) 


C= | 1 i : 7) +3(: | 1 1) 


Aj = 


B= 


2ef 2 ef e f a FG 
Life, 1 f 
3(G att). (2.22) 
Cy = wte-te+d | 1 1 
a ef Oe oF? 
1 
a= oF 


Putting these ingredients into (2.21) yields the following formula for our 
lattice-point count. 


Theorem 2.10. For the rectangular rational triangle T given by (2.16), where 
e and f are relatively prime, 


Lr(t) = : (tr —u Peo u M(e+5 =) 


122 gem) Pp ie. gee tr) 
pees) TE Tae 


This identity can be rephrased in terms of the Fourier-Dedekind sum that 
we introduced in (1.13): 


ss Syste Ts 1; e) ar Bite B 1; i) : 
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The general formula for [7—not assuming that e and f are relatively 
prime—is the content of Exercise 2.36. 

Let’s pause for a moment and study the nature of [7 as a function of t. 
Aside from the last two finite sums (which will be put in the spotlight in 
Chapter 8) and the appearance of u and v, the function L7 is a quadratic 
polynomial in ¢. And in those two sums, t appears only in the exponent of 
roots of unity, namely as the exponent of € and €f. As a function of t, €! is 
periodic with period e, and similarly, a is periodic with period f. We should 
also remember that u and v are functions of t; but they can be easily written 
in terms of the fractional-part function, which again gives rise to periodic 
functions in t. So £7(t) is a (quadratic) “polynomial” in t, whose coefficients 
are periodic functions in t. This is reminiscent of the counting functions of 
Chapter 1, which showed a similar periodic-polynomial behavior. Inspired by 
both examples, we define a quasipolynomial @ as an expression of the form 


Q(t) = cn(t) t” +--+ + a(t) t+ c(t), 
where Co,...,Gn are periodic functions in t. The degree of Q is n,° and 
the least common period of co,...,Cn is the period of Q. Alternatively, 
for a quasipolynomial Q, there exist a positive integer k and polynomials 
P0;Pi5-++;Pe—1 Such that 


po(t) if t=0 mod k, 
pr(t) ift=1 mod k, 


pe-i(t) ift=k—1modk. 


The minimal such k is the period of Q, and for this minimal k, the polynomials 
Po; P1,---,Pr—-1 are the constituents of Q. 

By the triangulation and embedding-in-a-box arguments that began this 
section, we can now state a general structural result for rational polygons. 


Theorem 2.11. Let P be any rational polygon. Then Lp(t) is a quasipolyno- 
mial of degree 2. Its leading coefficient is the area of P (in particular, it is a 
constant). 


We have the technology at this point to study the period of Dp; we let the 
reader enjoy the ensuing details (see Exercise 2.37). 


Proof. By Exercises 2.2 and 2.36 (the general form of Theorem 2.10), the 
theorem holds for rational rectangles and right triangles whose edges are 
parallel to the axes. Now use the additivity of both degree-2 quasipolynomials 
and areas along with Theorem 1.5. 


5 Here we tacitly assume that cy is not the zero function. 
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2.8 Euler’s Generating Function for General Rational 
Polytopes 


By now, we have computed several instances of counting functions by setting 
up a generating function that fits the particular problem in which we are 
interested. In this section, we set up such a generating function for the lattice- 
point enumerator of an arbitrary rational polytope. Such a polytope is given 
by its hyperplane description as an intersection of half-spaces and hyperplanes. 
The half-spaces are algebraically given by linear inequalities, the hyperplanes 
by linear equations. If the polytope is rational, we can choose the coefficients 
of these inequalities and equations to be integers (Exercise 2.7). To unify 
both descriptions, we can introduce slack variables to turn the half-space 
inequalities into equalities. Furthermore, by translating our polytope into the 
nonnegative orthant (we can always shift a polytope by an integer vector 
without changing the lattice-point count), we may assume that all points in 
the polytope have nonnegative coordinates. In summary, after a harmless 
integer translation, we can describe every rational polytope P as 


P={xe RS): Ax=b} (2.23) 


for some integral matrix A € Z'*¢ and some integer vector b € Z™. (Note 
that d is not necessarily the dimension of P.) To describe the t'® dilate of P, 
we simply scale a point x € P by 4, or alternatively, multiply b by t: 


tp = {xeRdy: AZ =b} = {xe Rdg: Ax=tb}. 
Hence the lattice-point enumerator of P is the counting function 
Lp(t) =# {x €ZS,: Ax=tb}. (2.24) 


Example 2.12. Suppose P is the quadrilateral with vertices (0,0), (2,0), 
(1,1), and (0, 3) pictured in Figure 2.10. The half-space-inequality description 
of P is 
< 
P= {(ex2) € R? : @1,%2 = 0, os Teh S. ae 


Ly +x <2 


Thus, 


7 2. L1 + 2x2 < 3t, 
Lp(t) = {(ewan) €2 + ty ta Dy ob ay < OF 


(%1, %2,%3, £4) e Ve + @1,%2,%3,04 = 0, 


{reate: (1752) *=(8)} 


t+ 202+ x3 = 3t, 
1 +%.4+%4 = 2t 
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(1,1) 


Fig. 2.10 The quadrilat- (0, 0) (2, 0) 
eral P from Example 2.12. 


Using the ideas from Sections 1.3, 1.5, 2.3, and 2.7, we now construct a 
generating function for this counting function. In those previous sections, the 
lattice-point enumerator could be described with only one nontrivial linear 
equation, whereas now we have a system of such linear constraints. However, 
we can use the same approach of encoding the linear equation into geometric 
series; we just need more than one variable. When we expand the function 


1 
(1 — 2122) (1 — 2222) (1 — 21) (1 — 22) 2423" 


f (z1, 22) = 


into geometric series, 


f (21, 22) = 


n n n n 1 
( ~~ (2122) ; ( S- (2722) ( y a) ( yy =) pate 
ni>0 n2>0 n3>0 na>0 1 “2 


= S gn +2n2+n3 ae +n2+na 2b 


N1,-.-,na>0 


When we compute the constant term (in both z; and z2), we are counting 
solutions (n1,n2,n3,n4) € Zh, to 


1210\] no) _ (3¢ 
1101) ] ng | \2¢)? 
that is, the constant term of f (21, z2) counts the integer points in P: 


1 
(1 — 2122) (1 — 2229) (1 — 21) (1 — 22) zft23t 


Lp(t) = const 


50 2 A Gallery of Discrete Volumes 


We invite the reader to actually compute this constant term (Exercise 2.38). 
It turns out to be 


7.5 7+(-1)' 


t t4 
4° +3 8 
Returning to the general case of a polytope P given by (2.23), we denote 
the columns of A by c1,¢2,...,¢g. Let 2 = (2, 22,..-,%m) and expand the 
function i 
(2.25) 


(1 — 201) (1 —702)---(1 — Ze) 7th 


in terms of geometric series: 


grier gn2e2 |... graced 1 ; 
> > > zib 


ni 20 n220 na>O0 
Here we use the abbreviating notation z* := z{1z$?---z@™ for the vectors 
Z= (21, 22,---,2m) € C™ and a= (a1, a2,...,4@m) € Z™. In multiplying out 


everything, a typical exponent will look like 
mye, +NgCg +--+ +ngeg — tb = An— tb, 


where n = (ni, n2,...,Na) € Zo. That is, if we take the constant term of our 
generating function (2.25), we are counting integer vectors n € Z{, satisfying 


An-—tb=0, that is, An= tb. 


So this constant term will pick up exactly the number of lattice points n € ZS 
in tP: ~ 


Theorem 2.13 (Euler’s generating function). Suppose the rational poly- 
tope P is given by (2.23). Then the lattice-point enumerator of P can be 
computed as follows: 


1 
(1— 2%) (1 Se) . 


We finish this section with rephrasing this constant-term identity in terms 
of Ehrhart series. 


Lp(t) = const ( 


Corollary 2.14. Suppose the rational polytope P is given by (2.23). Then 
the Ehrhart series of P can be computed as 


1 
Ehrp(x) = const (a — 2¢1) (1 — 2°) --- (1 — 2&2) (i _ 5) : 


Proof. By Theorem 2.13, 
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Notes 


1. Convex polytopes are beautiful objects with a rich history and interesting 
theory, which we have only glimpsed here. For good introductions to poly- 
topes, we recommend [68,126,258]. Polytopes appear in a vast range of current 
research areas, including Grobner bases and commutative algebra [236], com- 
binatorial optimization [95,213], integral geometry [149], K-theory [73], and 
geometry of numbers [220]. 


2. The distinction between the vertex and hyperplane description of a convex 
polytope leads to an interesting algorithmic question; namely, how quickly can 
we retrieve the first piece of data from the second and vice versa [213, 258]? 


3. Ehrhart series are named after Eugéne Ehrhart (1906-2000),° who proved 
the main structure theorems which we will see in Chapter 3. The Ehrhart 
series of a polytope is an example of a Hilbert-Poincaré series. These series 
appear in the study of graded algebras (see, for example, [134,229]); in the 
Ehrhart case, this algebra lives in C[zf', z>',..., za ' za+1| and is generated 
by the monomials z™, where m ranges over all integer points in cone(P), the 
cone over P, which we will define in Chapter 3. Ehrhart series also appear in 
the context of toric varieties, a vast and fruitful subject [91,116]. 


4. The Eulerian numbers A(d,k) are named after Leonhard Euler (1707— 
1783)" and arise naturally in the statistics of permutations: A (d,k) counts 
permutations of {1,2,...,d} with & — 1 ascents. For more on A (d,k), see [87, 
Section 6.5] and [139]; for more connections between A (d,k) and Ehrhart 
theory, see [32]. 


5. The pyramids of Section 2.4 have an interpretation as order polytopes [230]. 
A curious fact about the lattice-point enumerators of these pyramids is that 
they have arbitrarily large real roots as the dimension grows [36]. 


6 For more information about Ehrhart, see http://icps.u-strasbg.fr/~clauss/Ehrhart .htm1. 


7 For more information about Euler, see 
http://www-history.mcs.st-andrews.ac.uk/Biographies/Euler.html. 
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6. The counting function Lo for the cross-polytope can, incidentally, also be 


written as 
min(d,t) 


> (5) (a) 


In particular, Zo is symmetric in d and t. The cross-polytope counting 
functions bear a connection to Laguerre polynomials, the d-dimensional 
harmonic oscillator, and the Riemann hypothesis. This connection appeared 
in [75], where Daniel Bump, Kwok—Kwong Choi, Par Kurlberg, and Jeffrey 
Vaaler also found a curious fact about the roots of the polynomials Do: they 
all have real part —4 (an instance of a local Riemann hypothesis). This fact 
was proved independently by Peter Kirschenhofer, Attila Pethé, and Robert 
Tichy [148]; see also the notes in Chapter 4. 


7. Theorem 2.8 marks the beginning of the general study of lattice-point 
enumeration in polytopes. Its amazingly simple statement was discovered 
by Georg Alexander Pick (1859-1942)® in 1899 [190]. Pick’s theorem holds 
also for a nonconvex polygon, provided its boundary forms a simple curve. 
In Chapter 14, we prove a generalization of Pick’s theorem that includes 
nonconvex curves. 


8. The results of Section 2.7 appeared in [43]. We will see in Chapter 8 that 
the finite sums over roots of unity can be rephrased in terms of Dedekind— 
Rademacher sums, which—as we will also see in Chapter 8—can be computed 
very quickly. The theorems of Section 2.7 will then imply that the discrete 
volume of every rational polygon can be computed efficiently. 


9. If we replace tb in (2.24) by a variable integer vector v, the counting 
function 
fw) =#{xe€Z$,: Ax=v} 


is called a vector partition function: it counts partitions of the vector v in 
terms of the columns of A. Vector partition functions are the multivariate 
analogues of our lattice-point enumerators Lp(t). They have many interesting 
properties and give rise to intriguing open questions [30, 58, 90, 235, 240). 


10. While Leonhard Euler most likely did not think of lattice-point enumer- 
ation in the sense of Ehrhart, we attribute Theorem 2.13 to him, since he 
certainly worked with generating functions of this type, probably thinking of 
them as vector partition functions. Perey MacMahon (1854-1929) developed 
powerful machinery for manipulating multivariate generating functions [167]; 
his viewpoint and motivation came from integer partitions, but his work 


8 For more information about Pick, see 
http://www-history.mcs.st-andrews.ac.uk/Biographies/Pick.html. 


9 For more information about MacMahon, see 
http://www-history.mcs.st-andrews.ac.uk/Biographies/MacMahon. html. 
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can be applied to more general linear-constraint settings, such as vector 
partition functions. The potential of Euler’s generating function for Ehrhart 
polynomials was already realized by Ehrhart [109,111]. Several modern ap- 
proaches to computing Ehrhart polynomials are based on Theorem 2.13 (see, 
for example, [29, 67, 159]). 


Exercises 


2.1. & Fix positive integers a, b,c,d such that ged(a, b) = ged(c,d) = 1 and 

¢ < £, and let P be the interval [¢,$] (so P is a 1-dimensional rational 

convex polytope). Compute Lp(t) = #(tP MZ) and Lpe(t) and show directly 

that Dp(t) and Lpo(t) are quasipolynomials with period lem(b, d) that satisfy 
Lpo (-t) => —Lp (t) . 

(Hint: Exercise 1.4(i).) 

2.2. & Fix positive rational numbers aj, b;,a2,b2 and let R be the rectan- 


gle with vertices (a1, 61), (@2, 61), (a2, be), and (a), bz). Compute L(t) and 
Ehrp(z). 


2.3. Fix positive integers a and b, and let T be a triangle with vertices (0,0), 
(a, 0), and (0,8). 


(a) Compute L(t) and Ehry(z). 
(b) Use (a) to derive the following formula for the greatest common divisor of 
a and b: 


b-1 
ka 

rcd(a,b) = 2 —|+a+b—-ab. 

ged(a,b) Sl 


(Hint: Exercise 1.12.) 


2.4. Prove that for two polytopes P Cc R™ and QO Cc R”, 
#((P x QDNZ™") = #(PNZ™)-#(QNZ"). 
Hence, Lpxa(t) = Lp(t) Lo(t). 


2.5. Prove that if F is a face of P and G is a face of F, then G is also a face 
of P. (That is, the face relation is transitive.) 


2.6. & Suppose A is a d-simplex with vertices V = {v1, v2,...,Va+1}. Prove 
that for every nonempty subset W C V, conv W is a face of A, and conversely, 
that every face of A is of the form conv W for some W C V. Conclude the 
following corollaries from this characterization of the faces of a simplex: 


(a) A face of a simplex is again a simplex. 
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(b) The intersection of two faces of a simplex A is again a face of A. 


2.7. & Prove that a rational convex polytope can be described by a system 
of linear inequalities and equations with integral coefficients. 


2.8. & Prove the properties (2.3) of the Eulerian numbers for all integers 
1<k<d, namely: 


(a) A(d,k) = A(d,d+1—k) 
(b) A(d,k) =(d—k+1) A(d—1,k—1) +k A(d—-1,k) 


2.9. & Prove (2.6); namely, for d > 0, Toa = = eo Wye 
2.10. & Prove (2.7): For t,k € Z and d € Zyo, 


c(t") _ earns 


2.11. The Stirling numbers of the first kind, stirl(n,m), are defined 
through the finite generating function 


u(a—1)---(a—n+1) a 


m=0 


(See also [1, Sequence A008275].) Prove that 


d 
1 \i-k d+t 
au stirl(d + 1,k+1)t* =( d ), 


the lattice-point enumerator for the standard d-simplex. 


2.12. Give a direct proof that the number of solutions (m1, mz2,...,7™a41) € 
i tomy +m2+-+-+ma+1 = t equals Gar (Hint: think of t objects lined 
up and separated by d walls.) 


2.13. Compute Lp(t), where P is the regular tetrahedron with vertices 
(0, 0, 0) o) (1, 1, 0) o) (1, 0, 1) ? (0, iL, Il), 


2.14. & Prove that the power series 


that defines the Bernoulli numbers has radius of convergence 27. 
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2.15. & Prove (2.11); namely, Ba(1 — x) = (—1)¢Ba(z). 
2.16. & Prove (2.12); namely, By = 0 for all odd d> 3. 


2.17. Show that for each positive integer n, 


na”! = Ss @ Bn—k(2). 


k=1 


This gives us a change of basis for the polynomials of degree < n, allowing us 
to represent every polynomial as a sum of Bernoulli polynomials. 


2.18. As a complement to the previous exercise, show that we also have a 
change of basis in the other direction. Namely, we can represent a single 
Bernoulli polynomial in terms of the monomials as follows: 


B,(a) = 3 & B,a?-*, 


k=0 


2.19. Show that for all positive integers m,n and for all x € R, 


(This is a Hecke-operator-type identity, originally found by Joseph Ludwig 
Raabe in 1851.) 


2.20. Show that B,(2+1)—B,(x) =na"1. 


2.21. An alternative way to define the Bernoulli polynomials is to give ele- 
mentary properties that uniquely characterize them. Show that the following 
three properties uniquely determine the Bernoulli polynomials, as defined in 
the text by (2.8): 

(a) Bo(a) = 1. 

(b) aBn(2) =nB,-1(2), for alln > 1. 


(c) te B, (x) dx = 0, for all n > 1. 


2.22. Use (2.13) to derive the following identity, which expresses a Bernoulli 
polynomial in terms of Eulerian numbers and binomial coefficients: 


re?) 


5 (Balt +2) ~ Ba) = A(d 1,41) ( : 


+ Ald —1,d=2) a jee Aidit) ({): 


2.23. & Prove Theorem 2.6: Ehrpipy:(g)(z) = #2 Ehro(z). 


=e 
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2.24. A Delannoy path is a path through lattice points in the plane with 
steps (1,0), (0,1), and (1,1) (ie., “east,” “north,” and “northeast”). Find a 
recurrence for the number D(m,n) of Delannoy paths from the origin to the 
point (m,n), and use it to compute the two-variable generating function 


m nm 1 
Dd Oe a, 


m>0 n>0 


Conclude from this generating function that the Ehrhart polynomial Lo (t) 
of the d-dimensional cross-polytope equals D(t,d). (Hint: start with the d‘® 
derivative of the generating function of D(t,d) with respect to w.) 


2.25. & Let R be an integral rectangle whose edges are parallel to the 
coordinate axes, and let 7 be a rectangular triangle two of whose edges are 
parallel to the coordinate axes. Show that Pick’s theorem holds for R and T. 


2.26. & Suppose P is an integral polygon with area A and with B lattice 
points on its boundary. Show that the area of tP is At?, and the number of 
boundary points on ¢P is Bt. (Hint: Exercise 1.12.) 


2.27. Let P be the self-intersecting polygon defined by the line segments 
((0, 0), (4, 2)], [(4, 2), (4, 0)], [(4, 0), (0, 2)], and [(0, 2), (0,0)]. Show that Pick’s 
theorem does not hold for P. 


2.28. Suppose that P and Q are integral polygons, and that Q lies entirely 
inside P. Then the area bounded by the boundaries of P and Q, denoted 
by P — Q, is a “doubly connected polygon.” Find and prove the analogue 
of Pick’s theorem for P — Q. Generalize your formula to a polygon with n 
“holes” (instead of one). 


2.29. Show that every convex integral polygon with more than four vertices 
must have an interior lattice point. 


2.30. Consider the rhombus 
R= {(x,y) : ala] + bly| < ab}, 
where a and b are fixed positive integers. Find a formula for L(t). 


2.31. We define the nt" Farey sequence to be the sequence, in order from 


smallest to largest, of all the rational numbers $ in the interval [0,1] such 


that a and b are coprime and b < n. For instance, the sixth Farey sequence is 
Oliirii2i1323451 


1? 6? 5? 47395? 275739495 6 1° 

(a) For two consecutive fractions ¢ and § in a Farey sequence, prove that 
be -—ad=1. 

(b) For three consecutive fractions $, § 


d@ 
that § = TF. 


and 5 in a Farey sequence, show 
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2.32. Let [x] denote the smallest integer greater than or equal to x. Prove 
that for all positive integers a and 0), 


a b 


a+(-1P $0 (-Dl = b+ (-12 7-11 mod 4. 


m=0 n=0 


(Hint: This is a variation of Exercise 1.6. One way to obtain this identity is by 
counting lattice points in a certain triangle, keeping track only of the parity.) 


2.33. & Verify (2.19). 
2.34. & Compute the partial fraction coefficients (2.22). 
2.35. & Let a,b be positive integers. Show that 


1 : ky—-1 , ab— 
(@-&) + 2ab 


2.36. & Let 7 be given by (2.16), and let c = gcd(e, f). Prove that 


1 ; 
+ terms with positive powers of (z - £F) : 


Ir(t)= a5 ltr-u-vy +5 (tr—u 2 aa i) 


of 
ee tfe £4 
"Al ar eerie | — 


il 1 utv-—tr 3 eo aa ot) 
2e | 2f ef fis ef 4 
1 e—1 pee) 1 f-1 ele iad 


tel oe 


2.37. Let P be a rational polygon, and let d be the least common multiple of 
the denominators of the vertices of P. Prove directly (using Exercise 2.36) 
that the period of Lp divides d. 


2.38. & Finish the calculation in Example 2.12, that is, compute 


1 
(1 — 2122) (1 — 2222) (1 — 21) (1 — 22) aft edt” 


const 


2.39. Compute the vector partition function of the quadrilateral given in 
Example 2.12, that is, compute the counting function 


Fon) = {ice thy: (1229) x= (2) 


for v1, v2 € Z. (This function depends on the relationship between v1 and v2.) 
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2.40. Search on the Internet for the program polymake [118]. You can down- 
load it for free. Experiment. 


Open Problems 


2.41. Choose d+ 1 of the 2% vertices of the unit d-cube O, and let A be the 
simplex defined by their convex hull. 


(a) Which choice of vertices maximizes vol A? 
(b) What is the maximum volume of such a A? 


2.42. Find classes of integral d-polytopes (Pa) 75, for which Lp, (t) is sym- 
metric in d and t. (The standard simplices A and the cross-polytopes © form 
two such classes.) 


2.43. We mentioned already in the notes that all the roots of the polynomials 
Le have real part —t; see [75,148]. Find other classes of polytopes whose 
lattice-point enumerator exhibits such special behavior. 


Chapter 3 


Counting Lattice Points in Polytopes: 
The Ehrhart Theory 


Ubi materia, ibi geometria. 


Johannes Kepler (1571-1630) 


Fig. 3.1 Self-portrait of 4989 
Eugéne Ehrhart. a“ 


Given the profusion of examples that gave rise to the polynomial behavior 
of the integer-point counting function Lp(t) for special polytopes P, we now 
ask whether there is a general structure theorem. As the ideas unfold, the 
reader is invited to look back at Chapters 1 and 2 as appetizers and indeed 
as special cases of the theorems developed below. 


3.1 Triangulations 


Because most of the proofs that follow work like a charm for a simplex, we first 
dissect a polytope into simplices. This dissection is captured by the following 
definition. 
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A triangulation of a convex d-polytope P is a finite collection T of 
d-simplices with the following properties: 


eo P=[JA. 


AET 
e For every A;, Ag € T, Ay M Ag is a face common to both A, and Ag. 
Figure 3.2 exhibits two triangulations of the 3-cube. We say that P can be 
triangulated using no new vertices if there exists a triangulation JT’ such 
that the vertices of every A € T are vertices of P. 


Fig. 3.2 Two (very different) triangulations of the 3-cube. 


Theorem 3.1 (Existence of triangulations). Every convex polytope can 
be triangulated using no new vertices. 


This theorem seems intuitively obvious, but it is not entirely trivial to prove. 
We will introduce a specific type of triangulation for a given polytope P, in the 
hope of achieving a maximum level of concreteness. Namely, given the polytope 
P C R¢ with vertices v1, v2,...,Vn, randomly choose hi, h2,...,hn € R, and 
define a new polytope 


Q := conv {(vi, h1), (v2, h2),.-+5 (Vn; tn) } C R*. (3.1) 


The lower hull of Q consists of all points that are “visible from below,” that 
is, all points (71,22,...,€a+41) € Q for which there is no € > 0 such that 
(@1,%2,...,a41 — €) € Q. A lower face of Q is a face of Q that is in the 
lower hull. Let 7 : R¢+! + R¢ be defined through 


(21, %9,--+,d,0d41) = (£1, 22,..-,2a); 


the projection that forgets the last coordinate. Exercise 3.3(a) implies that a 
lower face F projects to m(F) bijectively; in particular, F and 7(F) have the 
same face structure (we say that they are combinatorially equivalent). In 
a moment, we will show that all lower faces of Q are simplices, and that their 
projections form a triangulation of P. A triangulation that can be constructed 
from “lifting” a polytope in the above fashion is called regular. Figure 3.3 
shows an example. 
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Fig. 3.3 Constructing a regular triangulation of a 9-gon. 


Proof of Theorem 3.1. We may assume that P = conv {vj, V2,...,Vn} C R4 
is full dimensional. Construct a lifted polytope Q as in (3.1); our first goal is 
to show that every lower face of Q is a simplex. To achieve this goal, it suffices 
to prove that every lower facet is a simplex. This, in turn, is equivalent to the 
statement that every affinely independent set of d+ 1 vertices of P gets lifted 
to a set of d+ 1 points in R¢+! that determine a hyperplane H C R¢+! that 
does not contain any other lifted vertex of P. After relabeling, we may assume 
that the chosen d+ 1 affinely independent vertices are v,,Vv2,...,Va+1, and 
so H is given by the equation 


1 1 1 1 
det | Vi Vo -** Vat xX =0 (3.2) 
hy hg hati ®a41 
in the variables 71, 22,...,@a41. If we now specialize (x, ra41) = (vj, h,;) for 
some j > d+ 1, the determinant on the left-hand side of (3.2) cannot be zero, 
since hy, ho,...,ha41,h; were chosen randomly; in other words, the lifted 


vertex (vj,h;) is not on H for j > d+ 1. We have thus proved that every 
lower face of Q is a simplex. By our earlier bijective argument, 


T := {a(F): F is a lower face of Q} 


consists of simplices contained in P; we claim that T is a triangulation of P. 
To prove that P = ac A, we have “only” to show that P° C Unc A. 
So let x € P°; our goal is to construct a lower face F of Q such that x € 7(F). 
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Consider the line 
L:={x+Aea41: AER}, 


where eg+1 denotes the (d + 1)** unit vector; that is, 2 is a vertical line 
through x. Note that 
LAQVA#S, (3.3) 


since x € P°. So LM Q is a line segment with endpoints (x,y) and (x, z) for 
some y < z. Since (x,y) is on the boundary of Q, it is contained in some 
(proper) face F of Q. Thus we can find a supporting hyperplane 


H={perR™:a-p=6} 


that defines F such that Q C {p € R“*!: a- p > b}. Note that (x, z) cannot 
lie on H, since otherwise, the entire line segment from (x, y) to (x, z) would 
be in H, contradicting (3.3). Thus 


a: (x,y) =) and a-(x,z) >), 


which implies agyi(z — y) > 0 and so aq4; > 0. But then we can use 
Exercise 3.3(b) to conclude that F is a lower face of Q. By construction, 
x € m(F), which proves P C Uacr A. 

That the second defining property for a triangulation holds for T, namely 
that for every A;, Ag € T, Ay M Ag is a face common to both A, and Ag, is 
left to show in Exercise 3.3(c). 


3.2 Cones 


A pointed cone K C R? is a set of the form 
K = {v + Arwi + Agwe +--+ +AmWm : M1, A2;---;Am = OF," 


where v,W1,W2,...,Wm € R®@ are such that there exists a hyperplane H 
for which HNK = {v}; that is, K \ {v} lies strictly on one side of H. The 
vector v is called the apex of K, and the w, are the generators of K. The 
cone is rational if v,w,,w2,...,Wm € Q%, in which case we may choose 
W1,W2,...,Wm € Z! by clearing denominators. The dimension of K is the 
dimension of the affine space spanned by K; if K is of dimension d, we call it a 
d-cone. The d-cone K is simplicial if K has precisely d linearly independent 
generators. 

Just as polytopes have a description as an intersection of half-spaces, so 
do pointed cones: a rational pointed d-cone is the d-dimensional intersection 
of finitely many half-spaces of the form 


{x ER: a, 21 + agrg +++: + agra < b} 
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with integral parameters a), d@2,...,a@¢,b € Z such that the corresponding 
hyperplanes of the form 


{xe R?: a2) + gt +--+ + agra = b} 


meet in exactly one point. 

Cones are important for many reasons. The most practical for us is a 
process called coning over a polytope. Given a convex polytope P C R? with 
vertices V1, V2,-..,Vn, we lift these vertices into R¢+! by adding a 1 as their 
last coordinate. So, let 


wi =(vi,1), we = (va,1), ..., Wn = (Vn, 1). 
Now we define the cone over P as 
cone(P) = {Ayw, + Agwe +:°* + AnWn : Ai, Ag,---;An > 0} CR. 
This pointed cone has the origin as apex, and we can recover our original 


polytope P (strictly speaking, the translated set {(x,1): x € P}) by cutting 
cone(P) with the hyperplane xg11 = 1, as shown in Figure 3.4. 


v3 


N 
v2 


v1 
Fig. 3.4 Coning over a polytope. 


By analogy with the language of polytopes, we say that the hyperplane 
H ={x ¢R¢: a-x =b} isa supporting hyperplane of the pointed d-cone 
K if K lies entirely on one side of H, that is, 
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KC {xe Rt: a-x<b} or KC{xeRt:a-x>bd}. 


A face of K is a set of the form KM H, where H is a supporting hyperplane 
of K. The (d — 1)-dimensional faces are called facets, and the 1-dimensional 
faces edges, of K. The apex of K is its unique 0-dimensional face. 

Just as polytopes can be triangulated into simplices, pointed cones can be 
triangulated into simplicial cones. So, a collection T' of simplicial d-cones is a 
triangulation of the d-cone K if it satisfies the following: 


ek=Us. 


SET 
e For every S1,S2 € T, S; 1S is a face common to both S; and Sp. 


We say that K can be triangulated using no new generators if there 
exists a triangulation T’ such that the generators of every S € T are generators 
of K. 


Theorem 3.2. Every pointed cone can be triangulated into simplicial cones 
using no new generators. 


Proof. This theorem is really a corollary to Theorem 3.1. Given a pointed 
d-cone K with apex v, there exists a hyperplane H that intersects K only 
at v. Choose w € K°; then 


P:=(w-v+HA)nK 


is a (d— 1)-polytope whose vertices are determined by the generators of K. 
(This construction yields a variant of Figure 3.4.) Now triangulate P using no 
new vertices. Each simplex A; in this triangulation gives naturally rise to a 
simplicial cone 

S):={vt+Ax:A>0, xe Aj}, 


and these simplicial cones, by construction, triangulate K. 


3.3 Integer-Point Transforms for Rational Cones 


We want to encode the information contained by the lattice points in a set 
S CR? It turns out that the following multivariate generating function allows 
us to do this in an efficient way if S is a rational cone or polytope: 


og(Z) =og (21, Z2,-+-, Za) — > zm. 
mesnZzé 


The generating function og simply lists all integer points in S in a special 
form: not as a list of vectors, but as a formal sum of Laurent monomials. We 
call og the integer-point transform of S; the function og also goes by the 
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name moment generating function or simply generating function of S. The 
integer-point transform og opens the door to both algebraic and analytic 
techniques. 


Example 3.3. As a warmup example, consider the 1-dimensional cone K = 
(0, co). Its integer-point transform is our old friend 


1 


Ox (Zz) = »; Cm 


mE€[0,00)NZ m>0 


Example 3.4. Now we consider the 2-dimensional cone 
K := {Ai (1, 1) + A2(—2,3) : Ar, A2 > OF} C R? 


depicted in Figure 3.5. To obtain the integer-point transform ox, we tile K 


rp 
fe} ° fe) fe) fe) fe) fe} 
fe} fe) ° fe) ° fe) fe} 
° ° fo) ° fo) fo} 

K 
fe} ° fe) ° ° fe) fe) 


Fig. 3.5 The cone K from Example 3.4 and its fundamental parallelogram. 


by copies of the fundamental parallelogram 
T= {Ai 1) + A2(—2, 3) -O0< Ay, A2 < 1} Cc R?. 


More precisely, we translate II by nonnegative integer linear combinations of 
the generators (1,1) and (—2,3), and these translates will exactly cover K. 
How can we list the integer points in K as Laurent monomials? Let’s first list 
all vertices of the translates of II. These are nonnegative integer combinations 
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of the generators (1,1) and (—2,3), so we can list them using geometric series: 


f 1 
m _ JL) +k(-2,3) _ 
pS : ne, (1 — 2122) (L— 27723) © 


m=j(1,1)+k(-2,3) 720 D0 
jk>0 


We now use the integer points (m,n) € II to generate a subset of Z? by adding 
to (m,n) nonnegative linear integer combinations of the generators (1,1) and 
(—2,3). Namely, we let 


Lomn) = {(m,n) +911, 1) + k(-2,3): 7, k € Zo}. 


It is immediate that KM Z? is the disjoint union of the subsets Lon») as 
(m,n) ranges over ITM Z? = {(0,0), (0,1), (0,2), (—1, 2), (—1,3)}. Hence 


ox(z) = (l+z2+23+2)'23 + 27123) S- zm 


m=j(1,1)+k(—2,3) 
j,k20 


-1 -1 
ltt 2+2p +2723 


(1 — 2122) (1 — 2 ee) 


Similar geometric series suffice to describe integer-point transforms for 
rational simplicial d-cones. The following result utilizes the geometric series 
in several directions simultaneously. 


Theorem 3.5. Suppose 
Ka= {Arwi + Agw2 +--+ +AawWa : At, A2,-.+5Ad > 0} 


is a simplicial d-cone, where w1,W2,...,Wa € Z*. Then for v € R4, the 
integer-point transform oy+k of the shifted cone v+K is the rational function 


0. K(z) = Ov+t(Z) 
v+. (1 —2™1) (1 —2¥2)---(1—2a)’ 


where II is the half-open parallelepiped 
TI := {Aywy + Agw2 +--+ AGwa: 0< A, A2,-++5Ad < 1}. 


The half-open parallelepiped II is called the fundamental parallelepiped 
of K. 


Proof. In oy4x(z) = dime (v+K)nZ4 z™, we list each integer point m inv+K 
as the Laurent monomial z™. Such a lattice point can, by definition, be 
written as 

m=v+Aiwi + Agw2 +--- + Aawa 


for some numbers ;,A9,...,Aq > 0. Because the w;, form a basis of R¢, 
this representation is unique. Let’s write each A, in terms of its integer and 
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fractional parts: Ay = |Ax| + {Ax}. So 
m = v+({Ar} wit {A2} wot: --+-{Aa} wa) +l Ar] wit Ae] wet: +a] wa, 
and we should note that since 0 < {A,} < 1, the vector 

p:=ve4t {Ar} wi t {Ao} we +--+ + {Aa} wa 
is in v+IL. In fact, p € Z%, since m and |, | wz are all integer vectors. Again, 
the representation of p in terms of the w; is unique. In summary, we have 


proved that every m € v + KM Z4 can be uniquely written as 


m= p+ kiwi + kow2 +--+ + kawa (3.4) 


for some p € (v + II) NZ? and some integers ky, k2,...,kg > 0. On the other 
hand, let’s write the rational function on the right-hand side of the theorem 
as a product of series: 


(l1- we. ZW) - 2 a ps gp S zea 


pe (v+I)nZz4 k,>0 ka>0 


If we multiply everything out, a typical exponent will look exactly like (3.4). 


Our proof contains a crucial geometric idea. Namely, we tile the cone v+K 
with translates of v + II by nonnegative integral combinations of the w,. It is 
this tiling that gives rise to the nice integer-point transform in Theorem 3.5. 
Computationally, we therefore favor cones over polytopes due to our ability to 
tile a simplicial cone with copies of the fundamental domain, as above. More 
reasons for favoring cones over polytopes appear in Chapters 10 and 11. 

Theorem 3.5 shows that the real complexity of computing the integer- 
point transform 0,4 is embedded in the location of the lattice points in the 
parallelepiped v + II. 

By mildly strengthening the hypothesis of Theorem 3.5, we obtain a slightly 
easier generating function, a result we shall need in Section 3.5 and Chapter 4. 


Corollary 3.6. Suppose 
K:= {Arw, + Agw2 +--+ +AawWa : Ay, A2,-++5Ad 2 O} 


is a simplicial d-cone, where w1,W2,...,Wa € Z%, and v € R®¢, such that the 
boundary of v + K contains no integer point. Then 


0. K(z) = Ov+t1(Z) 
v+. (1 —2™1) (1 —2¥2)---(1—2a)’ 


where II is the open parallelepiped 
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= {Ayw) + Agwe +--+ + AGwa: 0 < Aq, A2,.--,Aa < It. 


Proof. The proof of Theorem 3.5 goes through almost verbatim, except that 
v+TII now has no boundary lattice points, so that there is no harm in choosing 
II to be open. 


Since a general pointed cone can always be triangulated into simplicial 
cones, the integer-point transforms add up in an inclusion—exclusion manner 
(note that the intersection of simplicial cones in a triangulation is again a 
simplicial cone, by Exercise 3.5). Hence we have the following corollary. 


Corollary 3.7. For a pointed cone 
K = {v+Arwi + Awe +++ +AmWm : AL AQs0% 4 Am > 0} 


with v € R4, wi, W2,...,Wm € Z%, the integer-point transform ox(z) evalu- 
ates to a rational function in the coordinates of z. 


Philosophizing some more, one can show that the original infinite sum 
ox(z) converges only for z in a subset of C¢, whereas the rational function 
that represents ox gives us its meromorphic continuation. In Chapters 4 
and 11, we will make use of this continuation. 


3.4 Expanding and Counting Using Ehrhart’s Original 
Approach 


Here is the fundamental theorem concerning the lattice-point count in an 
integral convex polytope. 


Theorem 3.8 (Ehrhart’s theorem). If P is an integral convex d-polytope, 
then Lp(t) is a polynomial in t of degree d. 


This result is due to Eugene Ehrhart, in whose honor we call Lp the 
Ehrhart polynomial of ?. Naturally, there is an extension of Ehrhart’s 
theorem to rational polytopes, which we will discuss in Section 3.8. 

Our proof of Ehrhart’s theorem uses generating functions of the form 
Ys f(t) 2, similar in spirit to those discussed at the beginning of Chapter 1. 
If f is a polynomial, this power series takes on a special form, which we invite 
the reader to prove (Exercise 3.13): 


Lemma 3.9. Jf 


Swot = 


t>0 


then f is a polynomial of degree d if and only if g is a polynomial of degree at 
most d and g(1) £0. 
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The reason we introduced generating functions of the form os(z) = 
dimesnzeZ™ in Section 3.3 is that they are extremely handy for lattice- 
point problems. The connection to lattice points is evident, since we are 
summing over them. If we are interested in the lattice-point count, we simply 
evaluate og at z = (1,1,...,1): 


Gi awl) = YI = ois #enZ). 


meSnZ4 meéSNZ4 


(Here we denote by 1 a vector all of whose components are 1.) Naturally, we 
should make this evaluation only if S' is bounded; Theorem 3.5 already tells 
us that it is no fun evaluating oj¢(1) if K is a cone. 

But the magic of the generating function og does not stop there. To literally 
take it to the next level, we cone over a convex polytope P. If P C R?% has 
the vertices v1, V2,...,Vn € Z%, recall that we lift these vertices into R¢+! by 
adding a 1 as their last coordinate. So let 


wi = (vi, 1), We = (Vo, 1), s++, Wn =(vp,1). 
Then 
cone(P) = {Ayw1 + Agw2 +-++ + AnWn : At, Aa,---;An > 0} CR. 


Recall that we can recover our original polytope P by cutting cone(P) with 
the hyperplane rg, = 1. We can recover more than just the original polytope 
in cone(P): by cutting the cone with the hyperplane xg, = 2, we obtain 
a copy of P dilated by a factor of 2. (The reader should meditate on why 
this cut is a 2-dilate of P.) More generally, we can cut the cone with the 
hyperplane xg, = t and obtain tP, as suggested by Figure 3.6. 

Now let’s form the integer-point transform o¢one(p) of cone(P). By what we 
just said, we should look at different powers of za+1: there is one term (namely, 
1), with zi, ,, corresponding to the origin; the terms with zj,, correspond to 
lattice points in P (listed as Laurent monomials in 21, 22,..., 2a), the terms 
with 23 41 correspond to points in 2P, etc. In short, 


OF cone(P) (21, Qype ey Zd41) 
=1+op(2,..-,2a) Za¢1 + O2P (21, +++; 2a) S44 te 
= 1+) ow (Bi penn tee : 
t>1 
Specializing further for enumeration purposes, we recall that op (1,1,...,1) = 


#(PZ*), and so 
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Fig. 3.6 Recovering dilates of P in cone(P). 


Peone() pljaacy Lega) = 1+ > oy (Tyas) eee 
t>1 
=14+ > # (P24) #41. 
t>1 


But by definition, the enumerators on the right-hand side are just evaluations 
of Ehrhart’s counting function, that is, @conecpy (1, 1,.--,1, Za41) is nothing 
but the Ehrhart series of P: 


Lemma 3.10. Ogonep) (1, 1,-.--,1,2) = 1+ Lp(t) z' = Ehrp(z). 
(P) 
t>1 


With this machinery at hand, we can prove Ehrhart’s theorem. 


Proof of Theorem 38.8. It suffices to prove the theorem for simplices, because 
we can triangulate any integral polytope into integral simplices, using no new 
vertices. Note that these simplices will intersect in lower-dimensional integral 
simplices. 

By Lemma 3.9, it suffices to prove that for an integral d-simplex A, 


Ehra(z) = 1 +S°La(t) z= GQ—-2ze 


t>1 
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for some polynomial g of degree at most d with g(1) 4 0. In Lemma 3.10, 
we showed that the Ehrhart series of A equals g¢one(a) (1, 1,.--,1, 2), so let’s 
study the integer-point transform attached to cone(A). 

The simplex A has d+ 1 vertices v1, V2,..-,Va41, and so cone(A) Cc R¢+1 
is simplicial, with apex the origin and generators 


wi =(vi,1), wo =(vo,1), ---, Ways = (Vays, 1) € 2. 
Now we use Theorem 3.5: 


_ OTL (21, 22, s +) Zd41) 
Ocone(A) (21, 22,++-, Zd41) = fal — 71) (1 — 72) me (1 = zWai1) ’ 


where IT = {rAiw1 + AgWwg +---+ Nd+1Wa41 :0< M1, r2, seach Nd+1 < 1}. This 
parallelepiped is bounded, so the attached generating function oy is a Laurent 
polynomial in 21, Z2,..., Za+41- 

We claim that the zg41-degree of oy is at most d. In fact, since the xq41- 
coordinate of each wy is 1, the xg4-coordinate of a point in IT is Ay +Ag+---+ 
Aa+1 for some 0 < Aq, Ag,..-,Aa¢1 < 1. But then Ay +A2+--+-+Aaqq1 < dtl, 
so if this sum is an integer, it is at most d, which implies that the zq;)-degree 
of on (21, 22,---, Za41) is at most d. Consequently, on (1,1,...,1, za41) is a 
polynomial in z441 of degree at most d. The evaluation oy (1,1,1,...,1) of this 
polynomial at zg41 = 1 is not zero, because oy (1,1,1,...,1) = # (IINZ**"), 
and the origin is a lattice point in II. 


Finally, if we specialize zY* to 2] = 22 =--:- = zq = 1, we obtain aids so 
that 
OT] OL, 1, saree 1, Zd+1) 
AT cone(A) (My Deep ll, eae) = (1 Nay 
— 4d4+1 


The left-hand side is Ehra (za41) = 1+ 0,5, La(t) 24,1 by Lemma 3.10. 


3.5 The Ehrhart Series of an Integral Polytope 


We can actually take our proof of Ehrhart’s theorem one step further by 
studying the polynomial oy (1,1,...,1, za41). As mentioned above, the coef- 
ficient of 2% 4, Simply counts the integer points in the parallelepiped I cut 
with the hyperplane xg4, = k. Let’s record this. 


Corollary 3.11. Suppose A is an integral d-simplex with vertices v1, V2,.--, 
Vati, and let w; = (v;,1). Then 

Axzt the, 2b le thizt+hs 
7 (l—z)@1 , 


Ehra(z) =1+ 5° La(t) 2! 


t>1 


where hj, equals the number of integer points in 
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{Arwi + Agwe ++-> + Adt1 Way: O< Ai, A2,---, Adz < 1} 


with last coordinate equal to k. 


This result can actually be used to compute Ehra, and therefore the 
Ehrhart polynomial, of an integral simplex A in low dimensions very quickly 
(a fact that the reader may discover in some of the exercises). We remark, 
however, that things are not as simple for arbitrary integral polytopes. Not 
only is triangulation a nontrivial task in general, but one would also have to 
deal with overcounting where simplices of a triangulation meet. 

Corollary 3.11 implies that the numerator of the Ehrhart series of an 
integral simplex has nonnegative coefficients, since they count something. 
Although it is not known whether the coefficients of the Ehrhart series of 
a general polytope count something, the nonnegativity property magically 
survives. 


Theorem 3.12 (Stanley’s nonnegativity theorem). Suppose P is an 
integral convex d-polytope with Ehrhart series 


hi 24 + he get ++ +h5 
Ehrp(z) = (l = z)d+1 ; 


Then ho, hj,...,h% are nonnegative integers. 


We call the numerator h3(z) of the Ehrhart series Ehrp(z) the h*-po- 
lynomial of P.! Theorem 3.12 says that it always has nonnegative integer 
coefficients; we will have (much) more to say about h%(z) in Chapter 10. 


Proof. Triangulate cone(P) C R¢*+! into the simplicial cones K1,K2,...,Km. 
Exercise 3.19 ensures that there exists a vector v € R?+! such that 


cone(P) 1 Z**! = (v + cone(P)) NZ**1 


(that is, we neither lose nor gain any lattice points in shifting cone(P) by 
v) and neither the facets of v + cone(P) nor the translated triangulation 
hyperplanes contain any lattice points. This implies that every lattice point 
in v + cone(P) belongs to exactly one simplicial cone v + K;: 


cone(P) N Z**1 = (v + cone(P)) Zt! = U ((V+Kj)NZ*), (3.5) 
j=l 
and this union is a disjoint union. If we translate the last identity into 
generating-function language, it becomes 


1 Other names for h(z) used in the literature are 6-vector/polynomial and Ehrhart 
h-vector/polynomial of P. 
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m 
Tcone(P) (21, ZQye 0g Zd+1) = .S Ov+K; (z1, 22,004 Zd+1) : 
j=1 


But now we recall that the Ehrhart series of P is just a special evaluation of 
Ocone(p) (Lemma 3.10): 


Bhte(2) = Genet) els cle) = oye,  Ljong les (3.6) 
j=l 
It suffices to show that the rational generating functions oy +x, (1,1,.--,1, 2) 


for the simplicial cones v + K; have nonnegative integer numerator. But 
this fact follows from evaluating the rational function in Corollary 3.6 at 
(1,15. 845.142): 


This proof shows a little more: Since the origin is in precisely one simplicial 
cone on the right-hand side of (3.5), we get on the right-hand side of (3.6) 
precisely one term that contributes 1/(1 — z)¢+1 to Ehrp; all other terms 
contribute to higher powers of the numerator polynomial of Ehrp. That is, 
the constant term h§ equals 1. The reader might feel that we are chasing our 
tail at this point, since we assumed from the very beginning that the constant 
term of the infinite series Ehrp is 1, and hence hg has to be 1, as a quick 
look at the expansion of the rational function representing Ehrp shows. The 
above argument shows merely that this convention is geometrically sound. 
Let’s record this: 


Lemma 3.13. Suppose P is an integral convex d-polytope with Ehrhart series 


hit the_, 2b tee tha 
Ehrp(z) = a a us : 


Then hg = 1. 


For a general integral polytope P, the reader has probably already discov- 
ered how to extract the Ehrhart polynomial of P from its Ehrhart series: 


Lemma 3.14. Suppose P is an integral convex d-polytope with Ehrhart series 
hezt+ht_,2tt+-.-+hiztl 
= _ fd d-1 1 
Ehrp(z) =1+ ) Lp(t) 2° = aus ; 


t>1 


t+d ,f(e+d=1 ~ Heel ft 


Proof. Expand into a binomial series: 
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hi zt the_, 201 4+---+hiz+l 
G— 2a 


t+d 
athe th oe ta th ei ( Je" 
(ha d-1 1 )d d 


d t+d 
“ii (Shem oma (Speen 


t>0 t>0 


t+d t+d 
+a ( d Jer +¥ d )s 


t>0 t>0 


=i @be x (tee 


t>d t>d-1 


+n)? (iuere» eee 


t>1 t>0 


Ehrp(z) = 


In all infinite sums on the right-hand side, we can begin the index t with 0 
without changing the sums, by the definition (2.1) of the binomial coefficient. 
Hence 


The representation of the polynomial Lp(t) in terms of the coefficients of 
Ehrp can be interpreted as the Ehrhart polynomial expressed in the basis 
er ear fester, oe (see Exercise 3.14). This representation is very useful, as 
the following results show. 


Corollary 3.15. If P is an integral convex d-polytope, then the constant term 
of the Ehrhart polynomial Lp is 1. 


Proof. Use the expansion of Lemma 3.14. The constant term is 


Lp(0) = (‘) +15 (“7") + thia(1) +13(4) - (‘) =1. 


This proof is exciting, because it marks the first instance in which we extend 
the domain of an Ehrhart polynomial beyond the positive integers, for which 
the lattice-point enumerator was initially defined. More precisely, Ehrhart’s 
theorem (Theorem 3.8) implies that the counting function 


Lp(t)=# (tP NZ‘), 
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originally defined for positive integers t, can be extended to all real or even 
complex arguments ¢ (as a polynomial). A natural question arises: are there 
nice interpretations of Lp(t) for arguments t that are not positive integers? 
Corollary 3.15 gives such an interpretation for t = 0. In Chapter 4, we will 
give interpretations of Lp(t) for negative integers t. 


Corollary 3.16. Suppose P is an integral convex d-polytope with Ehrhart 


series 
Axztt+ht_ 2b 14---+htz+1 


G—2e1 


Then hi = Lp(1) -d-1=#(PNZ4) —d-1. 


Ehrp(z) = 


Proof. Use the expansion of Lemma 3.14 with t = 1: 


d+1 d 2 1 
p(t) = ( 2 ) +ni(G) +e nia(G) +na(f) = aba ay. 


The proof of Corollary 3.16 suggests that there are also formulas for 
h3,hs,... in terms of the evaluations Lp(1), Lp(2),..., and we invite the 
reader to find them (Exercise 3.15). 

A final corollary to Theorem 3.12 and Lemma 3.14 states how large the 
denominators of the Ehrhart coefficients can be: 


Corollary 3.17. Suppose P is an integral polytope with Ehrhart polynomial 
Lp(t) = cat? +ceq_1t* 1 +---+e,t+1. Then all coefficients satisfy d! cy, € Z. 


Proof. By Lemma 3.14, 


itd fied . tert aft 
srt (MEA) tm) tema) +8) 


where the hj are integers by Corollary 3.11 and our proof of Theorem 3.12. 
Hence multiplying out this expression yields a polynomial in t whose coeffi- 
cients can be written as rational numbers with denominator d!. 


We finish this section with a general result that gives relations between 
negative integer roots of a polynomial and its generating function. This 
theorem will become handy in Chapter 4, in which we find an interpretation 
for the evaluation of an Ehrhart polynomial at negative integers. 


Theorem 3.18. Suppose p is a degree-d polynomial with the rational gener- 
ating function 


» haze the12zt1+---+hizt+ho 
dort) z ~ (1—z)@1 


t>0 


Then ha = ha-, =++: = hey, = 0 and hy £0 if and only if p(—1) = p(—2) = 
-=p(—(d—k)) =0 and p(—(d—k+1)) £0. 
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Proof. Suppose hg = hg-1 = ++: = heyy = 0 and hy #0. Then the proof of 
Lemma 3.14 gives 


t+d t+d—k+1 t+d—k 
v(t) = ho? Jetia(’* * Jam (* ). 


d d d 


All the binomial coefficients are zero for t = —1,—-2,...,-d+k, so those are 
roots of p. On the other hand, all binomial coefficients but the last one are 
zero for t = —d+k—1, and since h, 40, —d+k —1 is not a root of p. 

Conversely, suppose p(—1) = p(—2) = --- = p(—(d—k)) = 0 and 
p(—(d—k+1)) £0. The first root —1 of p gives 


0 = p(-1) = ho (“ ') +hy + *) 4esthg-1 (‘) +ha () =ha @) 


so we must have hg = 0. The next root —2 forces hg_; = 0, and so on, up to 
the root —d+k, which forces hy41 = 0. It remains to show that hz, 4 0. But 
if hy, were zero, then by a similar line of reasoning as in the first part of the 
proof, we would have p(—d +k — 1) = 0, a contradiction. 


3.6 From the Discrete to the Continuous Volume of a 
Polytope 


Given a geometric object S C R¢, its volume, defined by the integral vol S' := 
i. g @X, is one of the fundamental data of S. By the definition of the integral, say 
in the Riemannian sense, we can think of computing vol S by approximating 
S with d-dimensional boxes that get smaller and smaller. To be precise, if we 
take the boxes with side length +, then they each have volume a. We might 
further think of the boxes as filling out the space between grid points in the 


lattice (2Z)*. This means that volume computation can be approximated by 


: : : ee d 
counting boxes, or equivalently, lattice points in (4Z) : 


1S = li : #( S59 17 , 


It is a short step to counting integer points in dilates of S, because 


# (se (32)') =H (69 12"). 


Let’s summarize: 
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Lemma 3.19. Suppose S C R® is d-dimensional. Then 


a # (18024). 


vol S = lim 
t-0o 


We emphasize here that S' is d-dimensional, because otherwise (since S' could 
be lower-dimensional although living in d-space), by our current definition, 
vol S = 0. We will extend our volume definition in Chapter 5 to give nonzero 
relative volume to objects that are not full-dimensional. 

Part of the magic of Ehrhart’s theorem lies in the fact that for an integral 
d-polytope P, we do not have to take a limit to compute vol P; we need to 
compute “only” the d+ 1 coefficients of a polynomial. 


Corollary 3.20. Suppose P C R® is an integral convex d-polytope with 
Ehrhart polynomial cqgt® + cq_1 4-1 +--+» +eqt+1. Then cg = volP. 


Proof. By Lemma 3.19, 


t@ a FT ee t+1 
volP = Jim Cq + Cq 1 + + Cy + - 


Ca. 
—0o ta i 


On the one hand, this should not come as a surprise, because the number 
of integer points in some object should grow roughly like the volume of the 
object as we make it bigger and bigger. On the other hand, the fact that we 
can compute the volume as one term of a polynomial should be very surprising: 
the polynomial is a counting function and as such is something discrete, yet 
by computing it (and its leading term), we derive some continuous data. Even 
more, we can—at least theoretically—compute this continuous datum (the 
volume) of the object by calculating a few values of the polynomial and then 
interpolating; this can be described as a completely discrete operation! 

We finish this section by showing how to retrieve the continuous volume of 
an integral polytope from its Ehrhart series. 


Corollary 3.21. Suppose P C R?@ is an integral convex d-polytope, and 


Axzttht_ 2b 14+.--+htz+1 


Ehrp(z) = (aan 


1 * * * 
Then vol P = = (hi the_, +--+: tht +1). 


Proof. Use the expansion of Lemma 3.14. The leading coefficient is 
1 
d! 


(hg tho yt-:-+hp +1). 
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3.7 Interpolation 


We now use the polynomial behavior of the discrete volume Lp of an integral 
polytope P to compute the continuous volume vol? and the discrete volume 
Lp from finite data. 

Two points uniquely determine a line. There exists a unique quadratic whose 
graph passes through any three given noncollinear points. More generally, a 
degree-d polynomial p is determined by d+ 1 points (2, p(a)) € R? in general 
position. Namely, evaluating p(x) = car? + cq_1x7"! +--+ + ¢9 at distinct 
inputs %1,%2,...,Ua+41 gives 


p (x1) Ca 
p (a2) Cd-1 
: =V ; (3.7) 
P (La+1) co 
where 
af nf x, 1 
ee wd at t 1 | 
G44 oe “++ Gay1 1 
so that 
Ca p (x1) 
Cqd-1 p (x2) 
=v! . : (3.8) 
Co p (&a41) 


(Exercise 3.21 makes sure that V is invertible.) The identity (3.8) gives the 
famous Lagrange interpolation formula. 

This gives us an efficient way to compute Lp, at least when dim? is not 
too large. The continuous volume of P will follow instantly, since it is the 
leading coefficient cq of Dp. In the case of an Ehrhart polynomial Dp, we 
know that Dp(0) = 1, so that (3.7) simplifies to 


Lp (a1) —-1 we oh? os ay Ca 
Lp (a2) -1 ae ai} re Cd-1 
Lp (xa) = re ae “os Dq Cy 


Example 3.22 (Reeve’s tetrahedron). Let 7), be the tetrahedron with 
vertices (0,0,0), (1,0,0), (0,1,0), and (1,1,h), where h is a positive integer 
(see Figure 3.7). 
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Fig. 3.7 Reeve’s tetrahedron 7), (and 27),). 


To interpolate the Ehrhart polynomial D7, (t) from its values at various 
points, we use Figure 3.7 to deduce the following: 


4 = Ly, (1) =vol(T,h) +e+a4+1, 
h+9= L7,(2) = vol (Th) - 2° +c2-2?+e,-24+1. 


Using the volume formula for a pyramid, we know that 


vol (T;,) = =(base area) (height) = " : 


Clr 


Thus h+ 1 = h+ 2cg — 1, which gives us cg = 1 and cy = 2 — A Therefore, 


h h 
In(t)= FO +2 + (2-2) 041. oO 
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3.8 Rational Polytopes and Ehrhart Quasipolynomials 


We do not have to change much to study lattice-point enumeration for rational 
polytopes, and most of this section will consist of exercises for the reader. The 
structural result paralleling Theorem 3.8 is as follows. 


Theorem 3.23 (Ehrhart’s theorem for rational polytopes). If P is a 
rational convex d-polytope, then L-p(t) is a quasipolynomial in t of degree d. Its 
period divides the least common multiple of the denominators of the coordinates 
of the vertices of P. 


We will call the least common multiple of the denominators of the co- 
ordinates of the vertices of P the denominator of P. Theorem 3.23, also 
due to Ehrhart, extends Theorem 3.8, because the denominator of an inte- 
gral polytope P is 1. Exercises 3.26 and 3.27 show that the word divides in 
Theorem 3.23 is far from being replaceable by equals. 

We set out along the path toward a proof of Theorem 3.23 by stating the 
analogue of Lemma 3.9 for quasipolynomials (see Exercise 3.24): 


Lemma 3.24. If 


t_ g(z) 
dU FOe =F 


t>0 


then f is a quasipolynomial of degree d with period dividing p if and only if g 
and h are polynomials such that deg(g) < deg(h), all roots of h are p' roots 
of unity of multiplicity at most d+ 1, and there is a root of multiplicity equal 
tod+1 (all of this assuming that ? has been reduced to lowest terms). 


Our goal is now evident: we will prove that if P is a rational convex 
d-polytope with denominator p, then 


Ehrp(z) =1 ‘ee gs yt 


for some polynomial g of degree less than p(d+ 1). As in Section 3.4, we will 
have to prove this only for the case of a rational simplex. So suppose the 
d-simplex A has vertices v1, V2,..-,Va+1 € Q%, and the denominator of A is 
p. Again we will cone over A: let 


Wi = (vi, 1), Wa = (va, 1),..., Waya = (Vai, 1); 
then 
cone(A) = {Ai wi + A2W2 Sie Aat1Wad41 7 1, ro, sans Nd+1 = 0} Cc Re 


To be able to use Theorem 3.5, we first have to ensure that we have a 
description of cone(A) with integral generators. But since the denominator of 
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A is p, we can replace each generator w, by pw; € Z7*!, and we are ready 
to apply Theorem 3.5. From this point, the proof of Theorem 3.23 proceeds 
exactly like that of Theorem 3.8, and we invite the reader to finish it up 
(Exercise 3.25). 

Although the proofs of Theorem 3.23 and Theorem 3.8 are almost identical, 
the arithmetic structure of Ehrhart quasipolynomials is much more subtle 
and less well known than that of Ehrhart polynomials. 


3.9 Reflections on the Coin-Exchange Problem and the 
Gallery of Chapter 2 


At this point, we encourage the reader to look back at the first two chapters 
in light of the basic Ehrhart-theory results. Theorem 1.5 and its higher- 
dimensional analogue give a special set of Ehrhart quasipolynomials. On the 
other hand, in Chapter 2 we encountered many integral polytopes. Ehrhart’s 
theorem (Theorem 3.8) explains why their lattice-point enumeration functions 
were all polynomials. 


Notes 


1. Triangulations of polytopes and manifolds are an active source of research 
with many interesting open problems; for further study, we highly recom- 
mend [98]. 


2. Eugene Ehrhart laid the foundation for the central theme of this book in 
the 1960s, starting with the proof of Theorem 3.8 in 1962 [109]. The proof 
we give here follows Ehrhart’s original lines of thought. An interesting fact 
is that he did his most beautiful work as a teacher at a lycée in Strasbourg 
(France), receiving his doctorate at age 60 on the urging of some colleagues. 


3. Given any d linearly independent vectors w1,w2,...,wq € R%, the lat- 
tice generated by them is the set of all integer linear combinations of 
W1,W2,...,Wg. Alternatively, one can define a lattice as a discrete subgroup 
of R?¢, and these two notions can be shown to be equivalent. One might wonder 
whether replacing the lattice Z? by an arbitrary lattice L throughout the 
statements of the theorems—requiring now that the vertices of a polytope be 
in £—gives us any different results. The fact that the theorems of this chapter 
remain the same follows from the observation that every d-dimensional lattice 
can be mapped to Z% by an invertible linear transformation. 
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4. Richard Stanley developed much of the theory of Ehrhart (quasi)polyno- 
mials, initially from a commutative-algebra point of view. Theorem 3.12 is 
due to him [226]. The proof we give here appeared in [46]. We will give several 
extensions of Theorem 3.12 in Chapter 10. 


5. The tetrahedron 7}, of Example 3.22 was used by John Reeve to show that 
Pick’s theorem does not hold in R® (see Exercise 3.23) [202]. Incidentally, the 
formula for Lz, also proves that the coefficients of an Ehrhart polynomial are 
not always positive. 


6. There are several interesting questions (some of which are still open) 
regarding the periods of Ehrhart quasipolynomials. Some particularly nice 
examples about what can happen with periods were given by Tyrrell McAllister 
and Kevin Woods [169]. 


7. Most of the results remain true if we replace convex polytope by polytopal 
complex, which is a finite union of polytopes. One important exception is 
Corollary 3.15: the constant term of an “Ehrhart polynomial” of an integral 
polytopal complex C is the Euler characteristic of C. 


8. The reader might wonder why we do not discuss polytopes with irrational 
vertices. The answer is simple: nobody has yet found a theory that would 
parallel the results in this chapter, even in dimension 2. One notable exception 
is [19], in which irrational extensions of Brion’s theorem are given; we will 
study the rational case of Brion’s theorem in Chapter 11. On the other hand, 
there has been recent activity to study Ehrhart quasipolynomials of rational 
polytopes with real dilation parameters [14,164]. Ehrhart theory has been 
extended to functions other than strict lattice-point counting; one instance is 
described in Chapter 13. 


Exercises 


3.1. Show that the number of triangulations of a polygon P that uses only 


the n vertices of P equals +, (?”').? 


3.2. To each permutation 7 € Sz on d elements, we associate the simplex 
A, := conv {0,€4(1),€n(1) + On(2), +++ En(1) + en(2) $°°° + en(ay}, 


where e;,€,...,@q denote the unit vectors in R?. 


(a) Prove that {A, : 7 € Sq} is a triangulation of the unit d-cube [0, 1]¢. 


2 The integer 1. ( 


WHI ) is known as the n** Catalan number. 
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(b) Prove that all A, are congruent to each other, that is, each one can be 


obtained from any other by reflections, translations, and rotations. 
(c) Show that for all 7 € Sg, La_(t) = ge 


3.3. & Given the polytope P C R®%, construct a lifted polytope Q as in (3.1). 


(a) Let F be a lower face of Q, defined by the supporting hyperplane 
H = {xe€R': a-x=b}. Show that aay: # 0. (This means that 
the hyperplane is not horizontal.) 

(b) Let F be a face of Q, defined by the supporting hyperplane H = 
{x € R41: a-x =b} such that QC {x ER“: a-x > bd}. Prove that 
F is a lower face if and only if ag41 > 0. 

(c) Let F; and Fz be lower faces of Q. Prove that 1(F ,)M 1(F2) is a face 
common to both m(F,) and a(F2). 


3.4. Prove that every triangulation of a polygon P that uses only the vertices 
of P is regular. Give an example of a (3-dimensional) polytope that has a 
nonregular triangulation. (Hint: Begin by proving that the triangulation of 
the 2-dimensional point configuration pictured in Figure 3.8 is not regular.*) 


Fig. 3.8 A nonregular 
triangulation of a point 
configuration. 


3.5. & Suppose T is a triangulation of a pointed cone. Prove that the inter- 
section of two simplicial cones in T is again a simplicial cone. 


3.6. Find the generating function ox(z) for the following cones: 


(a) K = {\1 (0,1) + A2(1, 0) : A1,A2 > 0}; 
(b) kK = {A1(0, 1) + A2(1, 1) 4 M1, A2 > 0}; 
(c) K= {(3, 4) + 1(0, 1) + d2(2, 1) 7 Az, A2 > 0}. 


3.7. Fix two relatively prime positive integers a and b, and let 
K = {Ai (0,1) + Az(a, b) : A1,A2 = OF. 

Show that _ 

a-— . | 2 {+l 

a + ent kek 


(1 — 2) (1 — z¢28) 


3A triangulation of a point configuration {x1,x2,...,Xn} is a triangulation of 
conv {x1,X2,...,Xn} using x1,X2,...,Xn as vertices. 


oK (1, 22) = 
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3.8. &@ Let S C R™ and T C R”. Show that ogy (21, 22,---;2mtn) = 
Og (21, 22)+-+ 52m) OF (S415 2m42)+++1%m4n)« 

3.9. & Let K be a rational d-cone, and let m € Z?. Show that om+x(z) = 
Z™oK(Z). 


3.10. & For a set SC R¢, let —S := {-x: x € S}. Prove that 
1. 1 
o_g (21, 22,---,24) =o8|—,—,.--,—]. 


3.11. Given a pointed cone K C R¢@ with apex at the origin, let S := KN Z?. 
Show that if x,y € S, then x+y © S. (In algebraic terms, S is a semigroup, 
since 0 € S and associativity of the addition in S follows trivially from 
associativity in R7.) 


3.12. Suppose K C R?@ is a pointed cone with apex v, H is a supporting 
hyperplane of K with HK = {v}, and w € R? is such that v + w € K. 
Show that (H + w) MK is a convex polytope. 


3.13. & Prove Lemma 3.9: If 


fz = 


t>0 


then f is a polynomial of degree d if and only if g is a polynomial of degree 
at most d and g(1) 4 0. 


3.14. Prove that ngs (ee so ...,(*) is a basis for the vector space Pol, 
of polynomials (in the variable x) of degree less than or equal to n. 


3.15. For a polynomial p(t) = cat? + cq_it? 1 +--+-+ co, let Hp(z) be defined 


by 
ols 
dnl) z — ye 
t>0 


Consider the map ¢q: Polg + Polg given by p+> Hp. 


(a) Show that ¢q is a linear transformation. 
(b) Compute the matrix describing ¢q for d= 0,1,2,.... 
(c) Deduce formulas for h3,h3,..., similar to the one in Corollary 3.16. 


3.16. Compute the Ehrhart polynomials and the Ehrhart series of the sim- 
plices with the following vertices: 


(a) (0, 0, 0), (1,0; 0), (0, 2,0), and (0, 0, 3); 
(b) (0, 0, 0, 0), (150, 0, 0), (0, 2, 0,0), (0, 0, 2,0); and (0, 0, 0, 4). 
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3.17. Define the hypersimplex A(d,k) as the convex hull of 
{ei ey ee ee : 1<j1 <ja<s- < jr <d}, 


where e1,€2,...,€q are the standard basis vectors in R@. For example, A(d, 1) 
and A(d,d—1) are regular (d—1)-simplices. Compute the Ehrhart polynomial 
and the Ehrhart series of A(d, k). 


3.18. & Suppose H is the hyperplane given by 
H={xeR’: a, 21 + Agr +--+ +agxq = 0} 


for some a1, 4@2,...,aq € Z, which we may assume to have no common factor. 
Prove that there exists v € Z? such that U,,<z ((nv + H) NZ*) = Z*. (This 
implies, in particular, that the points in Z? \ H are all at least some minimal 
distance from H; this minimal distance is essentially given by the dot product 
of v with (a1, @2,...,da)-) 


3.19. & A hyperplane H is rational if it can be written in the form 
H= {x eER*: a2, + agra +--+ +agrq = b} 


for some a1,a2,...,aq,b € Z. A hyperplane arrangement in R? is a finite 
set of hyperplanes in R¢. Prove that a rational hyperplane arrangement H. 
can be translated so that no hyperplane contains any integer points. 


3.20. The conclusion of the previous exercise can be strengthened: Prove 
that a rational hyperplane arrangement 1 can be translated such that no 
hyperplane contains any rational points. 


3.21. & Show that 


a at} nr he | 
xe nt ss ag 1 
det . . . = II (x; — Zp). 
: cs : 1<j<k<d+1 
d — 
Tay Tayi °° Tati t 
Conclude that for distinct numbers x1, %2,...,%q41, the matrix 
af ot a Pa] 
weg at} ‘rs Do 
V = 
d d-1 
Tayi Tayi *** Tati 1 


is not singular. (V is known as the Vandermonde matrix.) 


3.22. Let P be an integral d-polytope. Show that 
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d 
vol P = q (0 +50 @ -H*En()) : 


3.23. As in Example 3.22, let 7;, be the tetrahedron with vertices (0, 0,0), 
(1,0,0), (0,1,0), and (1,1,7), where n is a positive integer. Show that the 
volume of 7;, is unbounded as n —> oo, yet for all n, 7, has no interior 
and precisely four boundary lattice points. This example proves that Pick’s 
theorem does not hold for a 3-dimensional integral polytope P, in the sense 
that there is no linear relationship among vol P, Lp(1), and Lpe(1). 


3.24. & Prove Lemma 3.24: If 


t g(2) 
dF) = Fe 


t>0 


then f is a quasipolynomial of degree d with period dividing p if and only if g 
and h are polynomials such that deg(g) < deg(h), all roots of h are p*® roots 
of unity of multiplicity at most d+1, and there is a root of multiplicity equal 
to d+1 (all of this assuming that # has been reduced to lowest terms). 


3.25. & Provide the details for the proof of Theorem 3.23: If P is a rational 
convex d-polytope, then Lp(t) is a quasipolynomial in t of degree d. Its period 
divides the least common multiple of the denominators of the coordinates of 
the vertices of P. 


3.26. Let 7 be the rational triangle with vertices (0,0), (1, v1), and (p,0), 


where p is a fixed integer > 2. Show that L7(t) = st et pete +1; in 
particular, D7 is a polynomial. 


3.27. Prove that for every d > 2 and p > 1, there exists a d-polytope P whose 
Ehrhart quasipolynomial is a polynomial (i.e., it has period 1), yet P has a 
vertex with denominator p. 


3.28. Prove that the period of the Ehrhart quasipolynomial of a 1-dimensional 
polytope is always equal to the least common multiple of the denominators 
of its vertices. 


3.29. Let 7 be the triangle with vertices (-5, —3), (5, —), and (0, 3). Show 
that L(t) =t? +c(t)t+1, where 


1 if t is even, 
c(t) = i 
0 if t is odd. 


(This shows that the periods of the “coefficients” of an Ehrhart quasipolyno- 
mial do not necessarily increase with decreasing power.) Find the Ehrhart 
series of T. 
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3.30. Prove the following extension of Theorem 3.12: Suppose P is a rational 
d-polytope with denominator p. Then 
h*(z) 


Ehrp(z) = Gan) 


where h*(z) is a polynomial with nonnegative integral coefficients. 


3.31. Find and prove a statement that extends Lemma 3.14 to Ehrhart 
quasipolynomials. 


3.32. d& Prove the following extension of Corollary 3.15 to rational polytopes. 
Namely, the Ehrhart quasipolynomial Lp of the rational convex polytope 
P CR? satisfies Lp(0) = 1. 


3.33. Prove the following analogue of Corollary 3.17 for rational polytopes: 
Suppose P is a rational polytope with Ehrhart quasipolynomial Lp(t) = 
ca(t) t4 + cq_1(t) t2-1 +++ + e1(t) t+ co(t). Then for allt € Z and0<k<d, 
we have d! c,(t) € Z. 


3.34. & Prove that Corollary 3.20 also holds for rational polytopes: Suppose 
P CR? is a rational convex d-polytope with Ehrhart quasipolynomial ¢q(t) t¢+ 
ca—-1(t) t¢-1! +--+ + e9(t). Then ca(t) equals the volume of P; in particular, 
ca(t) is constant. 


3.35. Suppose P is a rational convex polytope. Show that as rational functions, 


Ehrop(z) = $ (Ehrp (Vz) + Ehrp (-Vz)) : 
3.36. Suppose f and g are quasipolynomials. Prove that the convolution 


t 


F(t) = D0 f(s) g(t 8) 


s=0 


is also a quasipolynomial. What can you say about the degree and the period 
of F’, given the degrees and periods of f and g? 


3.37. Given two positive integers a and }, let 


no {t if alt, ana a= {1 if lt, 


0 otherwise, 0 otherwise. 


Form the convolution of f and g. What function is it? 


3.38. Suppose P C R™” and Q C R” are rational polytopes. Prove that the 
convolution of Lp and Lg equals the Ehrhart quasipolynomial of the polytope 
given by the convex hull of P x {0,} x {0} and {0,,} x Q x {1}. Here 0, 
denotes the origin in R?. 


838 3 Counting Lattice Points in Polytopes: The Ehrhart Theory 


3.39. We define the unimodular group SLz(Z) as the set of all d x d 
matrices with integer entries and determinant +1. 


(a) Show that each element of SLy(Z) acts on the integer lattice Z? in a 
bijective fashion. 

(b) Let P be an integral polytope, and let OQ := A(P), where A € SLa(Z), 
so that P and Q are unimodular images of each other. Show that Dp(t) = 


Lo(t). 


3.40. Search on the Internet for the program LattE: Lattice-Point Enumera- 
tion [94,154]. You can download it for free. Experiment. 


Open Problems 


3.41. How many triangulations are there for a given polytope? 


3.42. What is the minimal number of simplices needed to triangulate the unit 
d-cube? (These numbers are known for d < 7.) 


3.43. Classify the polynomials of a fixed degree d that are Ehrhart polynomials. 
This has been done completely for d = 2 [214] and is partially known for d = 3 
and 4 [36, Section 3]. (See also Open Problem 10.21.) 


3.44. Study the roots of Ehrhart polynomials of integral polytopes in a fixed 
dimension [36,55,62,131]. Study the roots of the numerators of Ehrhart series. 


3.45. Come up with an efficient algorithm that computes the period of an 
Ehrhart quasipolynomial. (See [253], in which Woods describes an efficient 
algorithm that checks whether a given integer is a period of an Ehrhart 
quasipolynomial.) 


3.46. Suppose P and Q are integral polytopes with the same Ehrhart poly- 
nomial, that is, Dp(t) = Lg(t). What additional conditions on P and Q do 
we need to ensure that integer translates of P and Q are unimodular images 
of each other? That is, when is Q = A(P) +m for some A € SLq(Z) and 
m € Z4? 


3.47. Find an “Ehrhart theory” for irrational polytopes. 


Chapter 4 
Reciprocity 


In mathematics you don’t understand things. You just get used to them. 


John von Neumann (1903-1957) 


While Exercise 1.4(i) gave us the elementary identity 


el] o 


for t € Z and a € Zyo, this fact is a special instance of a more general theme. 
Namely, (4.1) marks the simplest (1-dimensional) case of a reciprocity theorem 
that is central to Ehrhart theory. Let Z := [0, +] CR, a rational 1-polytope 


Fig. 4.1 Lattice points in tZ. 


(see Figure 4.1). Its discrete volume is (recalling Exercise 1.3) 


t 


Lz(t) = H dei, 


a 


The lattice-point enumerator for the interior T° = (0, +), on the other hand, 
is 


Lzo(t) = = | (4.2) 


a 
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(see Exercise 4.1). We remind the reader that both Lz(t) and L[z.(t), by 
definition, are functions in the positive-integer variable t. However, we can 
see in the above formulas that they are given by quasipolynomials, and (4.1) 
gives an algebraic relation between these two quasipolynomials: 


Lat)=—=iC). 


This chapter is devoted to proving that a similar identity holds for rational 
polytopes in any dimension: 


Theorem 4.1 (Ehrhart—Macdonald reciprocity). Suppose P is a conver 
rational polytope. Then the evaluation of the quasipolynomial Lp at negative 
integers yields 

Lp(=t) = (—1)9™"? Leet). 


This theorem belongs to a class of famous reciprocity theorems. A common 
theme in combinatorics is to begin with an interesting object P, and 


1. define a counting function f(t) attached to P that makes physical sense 

for positive integer values of ¢; 

recognize the function f as a polynomial in ¢; 

3. substitute negative integral values of t into the counting function f, and 
recognize f(—t) as a counting function of a new mathematical object Q. 


~ 


For us, P is a polytope, and Q is its interior. 


4.1 Generating Functions for Somewhat Irrational 
Cones 


Our approach to proving Theorem 4.1 parallels the steps of Chapter 3: we 
deduce Theorem 4.1 from an identity for rational cones. We begin with a 
reciprocity theorem for simplicial cones. 


Theorem 4.2. Fix linearly independent vectors w1,w2,...,wq € Z%, and 
let K = {\ywi + Agwe +--+ + Aqwa: A1,---,Ad > O}, the simplicial cone 
generated by the w;. Then for those v € R¢ for which the boundary of the 
shifted simplicial cone v + K contains no integer point, 


1 1 1 
ore ( ee y= 1)? o_yix (21, 82)..-+ 2a) 
21 22 2d 


Remark. This theorem is meaningless on the level of formal power series; 
however, the identity holds at the level of rational functions. We will establish 
that oy; is a rational function in the process of proving the theorem. 


Proof. As in the proofs of Theorem 3.5 and Corollary 3.6, 
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0. K(z) = Ov+t(Z) 
v+ (1 —2™1) (1 —2¥2)---(1—2a)’ 


where II is the open parallelepiped 
TI = {Aywi + Aowe +--+ + Agwa: 0 < Aj, A2,.--,Ad < 1}. (4.3) 


This also proves that oy, is a rational function. Note that by assumption, 
v +I contains no integer points on its boundary. Naturally, 


o-v+m(Z) 
1—2™1)(1—22)---(1—2a)’ 


o-v+K (Z) = ( 


so we need to relate the parallelepipeds v + II and —v + II. This relation is 
illustrated in Figure 4.2 for the case d = 2; the identity for general d is (see 
Exercise 4.2) 


v+I=—(-v+ID + wi +wo+-:-+Wwa. (4.4) 


Now we translate the geometry of (4.4) into generating functions: 


W2 


v+Il 


—-v+I —(-v4+T]) —(-v +I) + wi + we 


Fig. 4.2 From —v+II tov+II. 
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Ov+n(Z) = O-(—v4r(Z) 2712?» 24 


-_ 1 1 i W1,W2 Wad 
=o-vin(2,4,..,£)2 Z aceay A 


(the last equation follows from Exercise 3.10). Let’s abbreviate the vector 


1 1 1 1 . . . . 
(2. Bees 1) by 5. Then the last identity is equivalent to 
if —-W1i,—-W2 —Wd 
Ov-+Il =e lee o_v+m(Z) Zz Z a4 , 
whence 


1\ _ Ov+II (2) 
wan () ~ Ce =e ee 
o_v4m(z) Zz ~1Z 2... ZW 
(1 = Z—W1) (1 = Z—W2) sate (1 = ZW) 


7 o—v+(Z) 
(21 — 1) (2w2 — 1)---(aw¢ — 1) 
= ( i o_v+m(Z) 


(1 — 2™1) (1 — 2¥2)--- (1 — 22) 
= (-1)¢0-v4K(2). 


4.2 Stanley’s Reciprocity Theorem for Rational Cones 


For the general reciprocity theorem for cones, we patch the simplicial cones 
of a triangulation together, in a manner very similar to what we did in our 
proof of Theorem 3.12. 


Theorem 4.3 (Stanley reciprocity). Suppose K is a rational d-cone with 
the origin as apex. Then 


1 oil 1 
or ( ba y= 1)? axe (21, 2a, «65 2a) 


>] ? b 
21 22 2d 


Proof. Triangulate K into simplicial cones K1,K2,..., Km. Exercise 3.19 en- 
sures that there exists a vector v € R®@ such that the shifted cone v + K 
contains exactly the interior lattice points of K, 


KenZt =(v+K)nZ, (4.5) 
and there are no boundary lattice points on any of the triangulation cones: 


A(v+K;)NZt=o for all 7 = 1,...,m, (4.6) 
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as well as 
A(-—v+Kj)NZi =a for all j =1,...,m. (4.7) 
We invite the reader (Exercise 4.4) to realize that (4.5)—(4.7) imply 
KOZ =(-v+ K)nZ?. (4.8) 
Now by Theorem 4.2, 
ox (3) =o-vte (2) = o-vtK, (2) = DO(-1)* vt, (2) 
j=l j=l 


Note that the second and fourth equalities are true because of the validity 
of (4.7) and (4.6), respectively. 


4.3 Ehrhart—Macdonald Reciprocity for Rational 
Polytopes 


In preparation for the proof of Theorem 4.1, we define the Ehrhart series 
for the interior of the rational polytope P as 


Ehrpe(z) := py Lpo(t) 2". 


t>1 


Our convention of beginning the series with t = 1 stems from the fact that 
this generating function is a special evaluation of the integer-point transform 
of the open cone (cone(P))°: much in syne with Lemma 3.10, 


Ehrpo (z) = O(cone(P))° (1, 1, seg 1; Zz) . (4.9) 
We are now ready to prove the Ehrhart-series analogue of Theorem 4.1. 
Theorem 4.4. Suppose P is a convex rational polytope. Then the evaluation 


of the rational function Ehrp at 4 yields 


1 : 

Ehrp (2) = (-1)%" ?*" Ehrpo(z). 
z 

Proof. Suppose P is a d-polytope. We recall Lemma 3.10, which states that 

the generating function of the Ehrhart polynomial of P is an evaluation of 

the generating function of cone(P): 


Ehrp(z) =~ Lp(t) z* = ocone(p) (1) 1,.--,1,2). 
t>0 


94 4 Reciprocity 


Equation (4.9) gives the analogous evaluation of o(cone(p))? that yields Ehrpo. 
Now we apply Theorem 4.3 to the (d+ 1)-cone K = cone(P): 


A (cone(P))° (1, 1y...54, z) _ (—1)** Fcone(P) ale Ljees5d, +) : 


Theorem 4.1 now follows like a breeze. 


Proof of Ehrhart-Macdonald reciprocity (Theorem 4.1). We first apply Exer- 
cise 4.7 to the Ehrhart series of P: namely, as rational functions, 


1 
Ehrp (<) =) 0 Lp(-t) 2 =- 0 Lp (-t)2*. 
ss t<0 t>1 
Now we combine this identity with Theorem 4.4 to obtain 
1 
t_¢_ 4)d+1 >) _/7_4)d = t 

S © Lpe(t) 2' = (—1)"? Ehrp (<) = (-1)7 $0 Lp(-t) 2. 
t>1 t>1 


Comparing the coefficients of the two power series yields the reciprocity 
theorem. 


With Ehrhart—Macdonald reciprocity, we can now restate Theorem 3.18 in 
terms of Ehrhart polynomials: 


Theorem 4.5. Suppose P is an integral d-polytope with Ehrhart series 


htzt+ht_,2t14..-+htz41 
Ehrp(z) = : “ a ; 


Then hy = hg_, =--- = hi, =0 and hy, £0 if and only if (d—k+1)P is 
the smallest integer dilate of P that contains an interior lattice point. 


Proof. Theorem 3.18 says that hf is the highest nonzero coefficient if and only 
if Lp(—1) = Lp(—2) =--- = Lp (-(d—k)) =0 and Lp (—(d—k + 1)) £0. 
Now use Ehrhart—Macdonald reciprocity (Theorem 4.1). 


The largest k for which hf # 0 is called the degree of P. The above 
theorem says that the degree of P is k precisely if (d—k+1)P is the smallest 
integer dilate of P that contains an interior lattice point. 


4.4 The Ehrhart Series of Reflexive Polytopes 


As an application of Theorem 4.4, we now study a special class of integral 
polytopes whose Ehrhart series have an additional symmetry structure. We 
call a polytope P reflexive if it is integral and has the hyperplane description 
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P={xeR’: Ax<1}, 


where A is an integral matrix. (Here 1 denotes a vector all of whose coordinates 
are 1.) The following theorem gives a characterization of reflexive polytopes 
through their Ehrhart series. 


Theorem 4.6 (Hibi’s palindromic theorem). Suppose P is an integral 
d-polytope that contains the origin in its interior and that has the Ehrhart 


—— hit the_, 20 t+ tht zthé 
Ehrp(z) = (12a : 


Then P is reflexive if and only if hi, =hi_, for allO0<k< q 


The two main ingredients for the proof of this result are Theorem 4.4 and 
the following: 


Lemma 4.7. Suppose a1,@2,...,@a,b € Z satisfy gcd (a1, a2,...,@a,0) = 1 
and b > 1. Then there exist positive integers c andt such that tb <c< (t+1)b 
and {(m1,ma,...,Ma) €Z*: aym, +agmo+--- + agm¢g = c} AD. 


Proof. Let g = gcd (aj, a2,...,@a); by our assumption, gcd(g, b) = 1, so one 
can find integers k and ¢ such that 


kg —tb=1. (4.10) 


Furthermore, we can choose k and t in such a way that t > 0. Let c = kg; 
Equation (4.10) and the condition b > 1 imply that tb <c < (¢+1)b. Finally, 
since g = gcd (aj, d2,..., aa), there exist mj ,m2,...,ma € Z such that 


aymy + agmg+-+--+aqgmg=kg=c. 


Proof of Theorem 4.6. We recall that P is reflexive if and only if 
P={xeR*: Ax <1} for some integral matrix A. (4.11) 

We claim that P has such a hyperplane description if and only if 
(t+ 1)P° NZ? =tP NZ? for allt € Zso. (4.12) 


This condition means that the only lattice points that we gain in passing from 
tP to (t+ 1)P are those on the boundary of (t + 1)P. The fact that (4.11) 
implies (4.12) is the content of Exercise 4.13. Conversely, if P satisfies (4.12), 
then there are no lattice points between tH and (t+ 1)H for any facet 
hyperplane H of P (Exercise 4.14). That is, if a facet hyperplane is given 
by H = {x ER?¢: aya; taor. +--+ + agra = b}, where we may assume 
gcd (a1, d2,...,@a,b) = 1, then 


{x € Z*: tb < aja] + a9%2 +--+ +agxq < (t+ 1)b} =2@. 
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But then Lemma 4.7 implies that 6 = 1, and so P has a hyperplane description 
of the form (4.11). 

Thus we have established that P is reflexive if and only if it satisfies (4.12). 
Now by Theorem 4.4, 


hg ttt tht zt te. +ht_ 2 t+hiz 


Bhrpe (2) = (-1)"*! Ehrp (<) = Ga 


By condition (4.12), P is reflexive if and only if this rational function is equal 
to 


So Lplt -l1)2°= z>_ Lp(t) z* = zEhrp(z) 


t>1 t>0 
Ae zttt phe 2tte thier +hiz 
(1—z)41 , 


that is, if and only if hj; = h%_, for allO<k< q. 


4.5 More “Reflections” on the Coin-Exchange Problem 
and the Gallery of Chapter 2 


We have already encountered special cases of Ehrhart-Macdonald reciprocity 
several times. Note that Theorem 4.1 allows us to conclude that counting 
the number of interior lattice points in a rational polytope is tantamount to 
counting lattice points in its closure. Exercises 1.33, 2.1, and 2.7, as well as 
part (b) of each theorem in the gallery of Chapter 2, confirm that 


Lp(—t) = (-1)%™? Lpo(t). 


Notes 


1. Ehrhart—Macdonald reciprocity (Theorem 4.1) had been conjectured (and 
proved in several special cases) by Eugéne Ehrhart for about a decade before 
I. G. Macdonald found a general proof in 1971 [166]. One can actually relax the 
condition of Ehrhart-Macdonald reciprocity: it holds for polytopal complexes 
that are homeomorphic to a d-manifold. The proof we give here (including the 
proof of Theorem 4.3) appeared in [46]. An attractive alternative approach 
was given by Steven Sam [209]. 


2. Theorem 4.3 is due to Richard Stanley [225], who proved more general 
versions of this theorem. The reader might recall that the rational function 
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representing the Ehrhart series of a rational cone can be thought of as 
its meromorphic continuation. Stanley reciprocity (Theorem 4.3) gives a 
functional identity for such meromorphic continuations. 


3. The term reflexive polytope was coined by Victor Batyrev, who found 
exciting applications of these polytopes to mirror symmetry in physical string 
theory [25]. Batyrev proved that the toric variety Xp defined by a reflexive 
polytope P is Fano, and that every generic hypersurface of Xp is Calabi— 
Yau. That the Ehrhart series of a reflexive polytope exhibits an unexpected 
symmetry (Theorem 4.6) was discovered by Takayuki Hibi [135]. Matthew 
Fiset and Alexander Kasprzyk proved a version of Theorem 4.6 for rational 
polytopes [114]. The number of reflexive polytopes in dimension d is known 
for d < 4 [156,157]; for example, there are precisely 16 reflexive polytopes in 
dimension 2, up to symmetries (see also [1, Sequence A090045]). 


A. Reflexive polytopes bear some pleasant surprises beyond those mentioned in 
Section 4.4. For example, Benjamin Braun [61] proved that if P is reflexive and 
Q is an integral polytope that contains the origin in its interior, where P and Q 
live in orthogonal subspaces of R¢, then the free sum P © Q := conv (P U Q) 
has Ehrhart series 


Ehrpgo(z) = (1 — z) Ehrp(z) Ehro(z), (4.13) 


an analogue of sorts to Exercise 2.4; see Exercise 4.17. Another striking result 
is that the sum of the numbers of lattice points on the boundaries of a reflexive 
polygon and its dual is always 12 [137,194]. A similar result holds in dimension 
3 (with 12 replaced by 24) [26], but no elementary proof of the latter fact is 
known [34, Section 4]. 


5. There is an equivalent definition for reflexive polytopes: P is reflexive if and 
only if both P and its dual P* are integral polytopes. The dual polytope of 
P (often also called the polar polytope) is defined as 


P* = {xe R*: x-y <1 forallyeP}. 


The concept of (polar) duality is not confined to polytopes but can be defined 
for any nonempty subset of R?. Duality is a crucial chapter in the theory of 
polytopes, and one of its applications is the equivalence of the vertex and 
hyperplane description of a polytope. For more about (polar) duality, the 
reader might consult [20, Chapter IV]. 


6. The cross-polytopes © from Section 2.5 form a special class of reflexive 
polytopes. We mentioned in the notes of Chapter 2 that the roots of the 
Ehrhart polynomials Lo all have real part -4 (75, 148]. Christian Bey, Martin 
Henk, and Jérg Wills proved in [55] that if all complex roots of Lp(t), for 
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some integral polytope P, have real part —t, then P is the unimodular image 
of a reflexive polytope. 


7. An integral polytope P C R®@ for which there exists a positive integer k such 
that (k —1)P°N Z4 = @, #(kKPNZ4) = 1 and tP° N Zt = (t— k)P NZ? for 
all integers t > k is called Gorenstein of index k. Thus reflexive polytopes 
are Gorenstein of index 1. (Some of the polytopes in Chapters 2 and 6 are 
Gorenstein.) Theorem 4.6 can be extended to the statement that an integral 
polytope is Gorenstein if and only if its nonzero hi’s are symmetric (not 
necessarily with center of symmetry d as in the case of reflexive polytopes); 
see Exercise 4.8. 


Exercises 


fH 
4.1. & Prove (4.2): for a,t € Zo, Loa) (4) = | 


a 
4.2. & Explain (4.4): if w1,Wwe,...,wa € R@ are linearly independent and 


Tl = {Aywi + Agwe +--+ + Aawa: 0 < Aq, A2,---,Aq < 1}, 


then v + II = —(-v + TI) + w, + wo +--+ + wa. 


4.3. Revisit Exercise 2.15, but now give a one-line proof that the Bernoulli 
polynomials satisfy By(1 — 7) = (—1)7Ba(z). 


4.4. & Prove that (4.5)—(4.7) imply (4.8); that is, if K is a rational pointed 
d-cone with the origin as apex and v € R? is such that 


KenZ?=(vt+K)nZ?, 
Av+K;)NZ =o for all j =1,...,m, 


and 
O(—v+K;)NZ4 =o for all 7 = 1,...,m, 
then 
KnZ? = (-v+K)nZ?. 
4.5. Prove the following generalization of Theorem 4.3 to rational pointed 
cones with arbitrary apex: Suppose XK is a rational pointed d-cone with the 


origin as apex, and v € R?. Then the integer-point transform oy+4(z) of the 
pointed d-cone v + K is a rational function that satisfies 
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1 d 
Ovi+k = = (—1) O(-v+K)° (z) : 
4.6. Generalize Theorem 4.3 by showing that we do not need to assume that 
K is full dimensional. 


4.7. & Suppose Q: Z > C is a quasipolynomial. We know that RG(z) — 
so Q(t) 2’ evaluates to a rational function. 


(a) Prove that Ro(z) = i<9 Q(t) 2" also evaluates to a rational function. 

(b) Let Q(t) = 1. Prove that as rational functions, RG(z) + Ro(z) =0. 

(c) Suppose Q is a polynomial. Prove that as rational functions, RS(z) + 
Ro(z) = 0. 

(d) Suppose Q is a quasipolynomial. Prove that as rational functions, RG(z) + 
Ro(z) = 0. 

4.8. & Suppose that P is an integral d-polytope for which 

Lpo (t) = Lp(t _ k) and Lpe(1) = Lpo (2) See Lpo (k — a8) =0 


for some integer k (so P is a Gorenstein polytope, as mentioned in the notes). 
Prove that 


1 7 

Ehrp (<) = (—1)4*1z* Ehrp(z). 
z 

4.9. Suppose P is an integral d-polytope with Ehrhart series 


Axzt+ht_, 2b +e-+htz+l 
Ehrp(z) = a — z)41 : 


Prove that h* = Lpo(1). 


4.10. Suppose P is an integral d-polytope. Show that the dilate (d+ 1)P 
contains an interior lattice point. 


4.11. Suppose P is an integral polytope. Denote the boundary of P by OP. 
Prove that Lgp(t) is a polynomial that is either even or odd. Determine its 
constant term. 


4.12. Recall the restricted partition function 
Pieasts,nchighh) = # {(m1, se ,Md) € ZS 2 MQ, ++++ + Mga = n} 


from Chapter 1. Prove that as quasipolynomials, 


P{ai,a2 cgnayh nm— ay a2 ase ad) = (-1)9"" Deas as ee aa} (7) 


and that 
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Pfay,a2,..., éap(— 1) = Plai,a2,..., éapk—2) Si 
= 1 eee a aes —ag—+++—aaqt 1) =0. 


4.13. & Prove that (4.11) implies (4.12), that is, show that if the polytope P 
is given by P = {x eR?: Ax< 1} for an integral matrix A, then 


(E+1)P° NZ? =tPnZ 
for all t € Z5o. 


4.14. & Suppose P is an integral polytope that satisfies (4.12): P°NZ4 = {0} 
and for all t € Zyo, ((+1)P° NZ4 = tPN Z4. Then for every t € Z, there are 
no lattice points between tH and (t+ 1)H for any facet hyperplane H of P. 


4.15. Compute the dual polytope of the d-dimensional cross-polytope ©. 


4.16. Suppose P C R? is an integral d-polytope with 0 € P°. Show that its 
dual P* is integral if and only if P is of the form {< e€R?: Ax< 1} for 
some integral matrix A. 


4.17. Prove (4.13): if P is reflexive and Q is an integral polytope that contains 
the origin in its interior, where P and Q live in orthogonal subspaces of R¢, 
then the free sum P © Q := conv (P U Q) has Ehrhart series 


Ehrpeo(z) = (1 — z) Ehrp(z) Ehro(z). 


Open Problems 


4.18. Suppose P is a 3-dimensional reflexive polytope. Denote by e* the 
edge in the dual polytope P* that corresponds to the edge e in P. Give an 
elementary proof that 


ye length (e) - length (e*) = 24. 
e edge of P 


4.19. Find the number of reflexive polytopes in dimension d > 5. 


4.20. Prove Mahler’s conjecture: if K is a d-dimensional centrally symmetric 


convex set, then 
4a 
vol(K) vol(K*) > rik 
(The right-hand side is the quantity vol(K) vol(K*) for K = [—1,1]%. See [125, 
Chapter 9] for further background and references.) 


Chapter 5 


Face Numbers and the 
Dehn—-Sommerville Relations in 
Ehrhartian Terms 


“Data! Data! Data!” he cried, impatiently. “I can’t make bricks without clay.” 


Sherlock Holmes (The Adventure of the Copper Beeches, by Arthur Conan Doyle, 1859— 
1930) 


Our goal in this chapter is twofold, or rather, there is one goal in two different 
guises. The first is to prove a set of fascinating identities, which give linear 
relations among the face numbers of a polytope. They are called Dehn-— 
Sommerville relations, in honor of their discoverers Max Wilhelm Dehn 
(1878-1952)! and Duncan MacLaren Young Sommerville (1879-1934).? Our 
second goal is to unify the Dehn—Sommerville relations (Theorem 5.1 below) 
with Ehrhart—Macdonald reciprocity (Theorem 4.1). 


5.1 Face It! 


We denote the number of k-dimensional faces of P by the symbol f,. As k 
varies from 0 to d, the face numbers f;, encode intrinsic information about 
the polytope P. The d-polytope P is simple if each vertex of P lies on 
precisely d edges of P. 


Theorem 5.1 (Dehn—Sommerville relations). If P is a simple d-polytope 
and0Q<k<d, then 


1 For more information about Dehn, see 
http://www-history.mcs.st-andrews.ac.uk/Biographies/Dehn. html. 


? For more information about Sommerville, see 
http: //www-history.mcs.st-andrews.ac.uk/Biographies/Sommerville.html. 
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This theorem takes on a particularly nice form for k = d, namely the 
famous Euler relation, which holds for every polytope (not just simple ones). 


Theorem 5.2 (Euler relation). [f P is a convex d-polytope, then 


d 
So(-1)' fj =1. 


j=0 


This identity is less trivial than it might look. We give a quick proof for 
rational polytopes, for which we can use Ehrhart—Macdonald reciprocity 
(Theorem 4.1). 


Proof of Theorem 5.2, assuming that P is rational. Let us count the integer 
points in tP according to the (relatively) open faces that contain them:? 


Lp(t) = So Lye(t) = So (-1)™F Lz (2). (5.1) 


FCP FCP 


Here and in the remainder of this chapter, the sums are over all nonempty faces. 
(Alternatively, we could agree that Lg(t) = 0.) Note that the first equality 
in (5.1), while innocent-looking, uses the not entirely trivial Exercise 5.3. The 
constant term of L(t) is 1 for every face F (by Exercise 3.32). Hence the 
constant terms of (5.1) give 


d 


1= ens = Rvs, 


FCP j=0 


which proves our claim. 


There is a natural structure on the faces of a polytope P induced by the 
containment relation F C G. This relation gives a partial ordering on the set 
of all faces of P, called the face lattice of P.* A useful way to illustrate this 
partially ordered set is through a directed graph whose nodes correspond to 
the faces of P, and we have an edge from F¥ to G if 


FCG and dimG = dim F +1. 


We draw this directed graph in such a way that the edge directions are upward; 
Figure 5.1 shows the face lattice for a triangle. Exercise 2.6 implies that the 
face lattice of every simplex is a Boolean lattice, which is the partially 
ordered set formed by all subsets of a finite set, where the partial ordering is 
again subset containment. 

We already mentioned that we will unify the Dehn—Sommerville relations 
(Theorem 5.1) with Ehrhart—Macdonald reciprocity (Theorem 4.1). It is for 


3 Note that the relative interior of a vertex is the vertex itself. 


4 The use of the word lattice here is disjoint from our previous definition of the word. 
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Fig. 5.1 The face lattice of a triangle. 


this reason that we will prove Theorem 5.1 only for rational polytopes. To 
combine the notions of face numbers and lattice-point enumeration, we define 


F(t) = Sl Lel(t), 
FOP 
dim F=k 
the sum being taken over all k-faces of P. By Ehrhart’s theorem (Theo- 
rem 3.23), F;, is a quasipolynomial. Since Lz(0) = 1 for all J, 


Fi,(0) = fr, 


the number of k-faces of P. We also remark that the leading coefficient of F;, 
measures the relative volume of the k-skeleton of P, that is, the union of all 
k-faces; see Section 5.4 for a precise definition of relative volume. 

Our common extension of Theorems 5.1 and 4.1 is the subject of the next 
section. 


5.2 Dehn—Sommerville Extended 


Theorem 5.3. If P is a simple rational d-polytope and0 <k <d, then 


The classical Dehn—Sommerville equations (Theorem 5.1)—again, only 
for rational polytopes—are obtained from the constant terms of the count- 
ing functions on both sides of the identity. On the other hand, for k = d, 
Theorem 5.3 gives (with t replaced by —t) 
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d 
Lp(—t) = Fa(-t) = ee) a1} G1 eG 


j=0 j=0 


The sum on the right-hand side is an inclusion—exclusion formula for the 
number of lattice points in the interior of ¢P (count all the points in ¢P, 
subtract the ones on the facets, add back what you have overcounted, etc.), 
so in a sense, we recover Ehrhart—Macdonald reciprocity. 


Proof. Suppose F is a k-face of P. Then again by counting the integer points 
in F according to relatively open faces of F (Exercise 5.3), 


= S5 Lge(t) 


GOF 


or by the Ehrhart—Macdonald reciprocity (Theorem 4.1), 


i 
Ly(t) = Do (-1) 8 Lg (-t) = S0(-1)" DS Le (-t). (6.2) 


GCF j=0 GOF 


Now sum both left- and right-hand sides over all k-faces and rearrange the 
sum on the right-hand side: 


k 
FeQ)= SY) SO-1? SO Le (-4) 
FCP j=0 GOF 
dim F=k dim G=j 
k 
= PD 
2 FCP GOF 


dim F=k dim G=j 


k 
=D! DE fe(P/9)Lo(-4) 


j=0 GOP 
dim G=j 
k 
= aig d—Jj = 
tS p> (7-4) tel 
dim G=j 
k 
= i(4-J 
= (VA) BO 


Here f,(P/G) denotes the number of k-faces of P containing a given j-face G 
of P. Since P is simple, this number equals (7-7) (see Exercise 5.5). 
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5.3 Applications to the Coefficients of an Ehrhart 
Polynomial 


We will now apply Theorem 5.3 to the computation of the Ehrhart polynomial 
of an integral d-polytope P. The only face-lattice-point enumerator involving 
the face P is Fa(t), for which Theorem 5.3 specializes to 


d 
Lp(t) = Fat) = 5 0(-1) 5(-2). 
j=0 


In fact, we do not have to assume that P is simple, since this identity simply 
counts integer points by faces. (Recall that (—1)/F;(—t) counts the integer 
points in the ¢-dilates of the interior of the j-faces.)° The last term on the 
right-hand side is 


(—1)?Fa(—t) = (-1)*Lp(-t) = Lpo(t) 


by Ehrhart—Macdonald reciprocity. Shifting this term to the left gives 
Lp(t) — Lpo(t) = So(-1)'Fj(-2). (5.3) 


The difference on the left-hand side of this identity has a natural interpretation: 

it counts the integer points on the boundary of tP. (And in fact, the right-hand 

side is once more an inclusion—exclusion formula for this number.) Let us 

write Lp(t) = cgt? + cg_1 2-1 + +++ +9. Then Lpo(t) = egt? — cg_yt?-1 + 
- + (—1)4co, so that 


Lp(t) — Lpo(t) = 2cg-1t* | + 2cg_3t 2 +---, 


where this sum ends with 2cp if d is odd and 2c;t if d is even (this should 
look familiar; see Exercise 4.11). Combining this expression with (5.3) yields 
the following useful result. 


Theorem 5.4. Suppose Lp(t) = cat? + ca_1t41 +--++ 9 is the Ehrhart 
polynomial of P. Then 


1 : 
ca-1t*-1 + cq-gt? 3 4-66 = 5 DEH), 


We can make the statement of this theorem more precise (but also messier) 
by writing 


5 So one might argue that we did not need the Dehn-Sommerville machinery for the 
computations in the current section. This argument is correct, although Theorem 5.3 is a 
strong motivation. 
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F;(t) = » Lr(t) = Chg t + 65-1871 +--+ +50. 
FOP 
dim F=j 


Then collecting the coefficients of t* in Theorem 5.4 yields the following 
relations. 


Corollary 5.5. If k and d are of different parities, then 


d—1 


1 oe 
eee) So(-1) Fey b 


j=0 


If k and d have the same parity, then the left-hand side has to be replaced 
by 0. 

The first coefficient c, in the Ehrhart polynomial of a d-polytope P satisfy- 
ing the parity condition is cg_;. In this case, Corollary 5.5 tells us that cq_1 
equals s times the sum of the leading coefficients of the Ehrhart polynomials 
of the facets of P. 

The next interesting coefficient is cqg_3. For example, if dimP = 4, we can 
use Corollary 5.5 to compute c; entirely from (the linear coefficients of) the 
Ehrhart polynomials of the faces of dimension < 3. 


Fig. 5.2 The line segment from (0,0) to (4,2) and its affine sublattice. 


5.4 Relative Volume 


It is time to return to continuous volume. Recall Lemma 3.19: if S Cc R¢@ 
is d-dimensional, then vol S = lim; +5. 4 ff (tS a) Z*), Back in Chapter 3, 
we stressed the importance of S being d-dimensional, because otherwise 
(i.e., S is lower-dimensional although living in d-space), by our definition, 
vol, = 0. However, the case that S$ C R? is not of dimension d is often very 
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interesting; an example is the polytope P that we encountered in connection 
with the coin-exchange problem in Chapter 1. We still would like to compute 
the volume of such objects, in the relative sense. This makes for a slight 
complication. Let us say S C R¢@ is of dimension m < d, and let span S = 
{x + XA(y — x): x,y € S, X € R}, the affine span of S. If we follow the same 
procedure as above (counting boxes or grid points), we compute the volume 
relative to the sublattice (span 9) M Z4; we call this the relative volume 
of S. 

For example, the line segment L from (0,0) to (4,2) in R? has the relative 
volume 2, because in span = {(z,y) € R?: y= $2}, L is covered by two 
segments of “unit length” in this affine subspace, as pictured in Figure 5.2. A 
3-dimensional instance that should be reminiscent of Chapter 1 is illustrated 
in Figure 5.3. 


Fig. 5.3 The triangle defined by 7 + oo +5 =1,2>0,y > 0,z > 0. The shaded region 
is a fundamental domain for the sublattice that lies on the affine span of the triangle. 


If S C R?@ has full dimension d, the relative volume coincides with the 
“full-dimensional” volume. Henceforth, when we write vol S' we refer to the 
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relative volume of S. With this convention, we can rewrite Lemma 3.19 to 
accommodate a set S C R@ that is m-dimensional: its relative volume can be 
computed as 


ms, d 
vol S = jim = # (tS NZ de 
In the case that # (tS a) Z*) has the special form of a polynomial—for 
example, if S is an integral polytope—we can further simplify this theorem. 
Suppose P C R?¢ is an integral m-polytope with Ehrhart polynomial 
Lp(t) = ont” toma t™ +e tett1. 


Then according to the above discussion, and much in syne with Lemma 3.19, 


Cris 


1 t™ _yt™-t4... t+1 
vol P = lim —Lp(t) = lim cmt em=1 Tees 


too t t-00 ¢™ 


The relative volume of P is the leading term of the corresponding counting 
function Lp. 

For example, in the previous section we found that Corollary 5.5 implies 
that the second leading coefficient cg_; of the Ehrhart polynomial of the 
d-polytope P equals 4 times the sum of the leading terms of the Ehrhart 
polynomials of the facets of P. The leading term for one facet is simply the 


relative volume of that facet: 


Theorem 5.6. Suppose Lp(t) = cat? + cg_1t¢-1 +--++ 9 is the Ehrhart 
polynomial of the integral polytope P. Then 


1 
Ga = 5 >» vol F . 
F a facet of P 


Notes 


1. The Dehn-Sommerville relations (Theorem 5.1) first surfaced in the work 
of Max Dehn, who proved them in 1905 for dimension 5 [100]. (The Dehn— 
Sommerville relations are not that complicated for d < 4; see Exercise 5.4.) 
Some two decades later, D. M. Y. Sommerville proved the general case [223]. 
Theorem 5.1 was neither well known nor much used in the first half of 
the twentieth century, achieving renown only after its rediscovery by Victor 
Klee [150] and its appearance in Branko Griinbaum’s famous and widely 
read book [126]. The Dehn-Sommerville relations will play a prominent role 
in Chapter 10. 


2. The Euler relation (Theorem 5.2) is easy to prove directly for d = 3 (this 
case is attributed to Euler), but for higher dimension, one has to be somewhat 
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careful, as we already remarked in the text. The classical proof for general 
d was found in 1852 by Ludwig Schlafli [211], although it (like numerous 
later proofs) assumes that the boundary of a convex polytope can be built up 
inductively in a “good” way. This nontrivial fact—which is called shelling of a 
polytope—was proved by Heinz Bruggesser and Peter Mani in 1971 [71]. (One 
instance of a shelling will be the subject of Exercise 10.16.) Shellability is 
nicely discussed in [258, Lecture 8]. There are short proofs of the Euler relation 
that do not use the shelling of a polytope (see, for example, [162, 184, 248]). 


3. The reader might suspect that proving Theorems 5.1 and 5.2 for rational 
polytopes suffices for the general case, since it seems that we can transform a 
polytope with irrational vertices slightly to one with only rational vertices 
without changing the face structure of the polytope. This is true in our 
everyday world, but it fails in dimension > 4 (see [206] for dimension 4 
and [258, pp. 172-173] for general dimension). 


4. Theorem 5.3 is due to Peter McMullen [171], who, in fact, proved this 
result in somewhat greater generality. Another generalization of Theorem 5.3 
can be found in [84]. 


Exercises 


5.1. Consider a simple 3-polytope with at least five facets. Two players play 
the following game: each player, in turn, signs his or her name on a previously 
unsigned face. The winner is the player who first succeeds in signing three 
facets that share a common vertex. Show that the player who signs first will 
always win by playing as well as possible.® 


5.2. Show that for the d-cube, fy = 2¢-*(?). 
5.3. & Given a polytope P, prove that 
P=) 
FCP 
as a disjoint union over all faces of P, and deduce from this (5.1). 


5.4. Give an elementary proof of the Dehn-Sommerville relations (Theo- 
rem 5.1) for d < 4. 


5.5. de Let P be a simple d-polytope. Prove that the number of k-faces of P 
containing a given j-face of P equals (777). 


5.6. & Show directly, without using Theorem 5.2, that for a d-simplex, 


6 This was one of the 2002 Putnam contest problems. 
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(a) fe = (c45)3 


5.7. & Prove Theorem 5.1 directly (and hence not requiring P to be an 
integral polytope). (Hint: Orient yourself along the proof of Theorem 5.3, 
but start with the Euler relation (Theorem 5.2) for a given face F instead 
of (5.2).) 


5.8. Let F be a face of a simple polytope P. Prove that for 0 < k < d, 


Saye (AF) = aye (Sn): 


GOOF 


5.9. & The dual polytope of P (often also called the polar polytope) is 
defined as 
P* = {xeR?*: x-y <lforallyeP}. 


(For P* to be a polytope, we need to require that the origin be in the interior 
of P; see also the notes at the end of Chapter 4.) 


(a) Prove that the face lattice of P* equals the face lattice of P turned upside 
down. 

(b) Show that P* is simple if and only if P is simplicial, that is, all nontrivial 
faces of P are simplices. 

(c) Prove the Dehn—Sommerville relations for a simplicial d-polytope: for 


O0<k<d, 
d-1 a4 
= —1)¢-j-1 . 
f= yaa te 
j=k 
This identity also holds for k = —1 if we define f_,; := 1. (Hint: Try a 
proof that is dual to the one hinted at in Exercise 5.7.) 


5.10. Show that the equations in Theorem 5.3 are equivalent to the following 
identities. If P is a simple lattice d-polytope and k < d, then 


Deve 6 _ : Fu_;(-t) = S0(-1)* @ F(t). 


TJ 


5.11. Prove that the equations in the previous exercise imply the following 
identities, which compare the number of lattice points in faces and relative 
interiors of faces of the simple polytope P: 


k fi d ‘ 

Sew(e4) yo eenzy= (1) O #@nzy, 
J FCP i=k GCP 

dim F=d—j dim G=i 


Exercises 111 
where k = 0,...,d = dim/?. For example, for k = 0, 


#(PNZ) = S° #(G°NZ4), 


GCP 
and for k = d, we obtain the inclusion—exclusion formula 


d 
#(P°NZ) =S\(-)*4 SO # (FZ). 
j=0 FCP 
dim F=j 
5.12. Another nice reformulation of Theorem 5.3 is the following generalized 
reciprocity law. For a simple integral d-polytope P, define the generalized 
Ehrhart polynomial 


for 0 << k <d. Prove the following generalized reciprocity law: for 0 < k < d, 
Ex(—t) = (-1)* Ea-x(t), 


which implies Ehrhart—-Macdonald reciprocity (Theorem 4.1) for simple inte- 
gral polytopes when k = 0. 


5.13. What happens when P is not simple? Give an example for which 
Theorem 5.3 fails. 


5.14. This exercise shows that sometimes, we may use the pigeonhole principle 
in a surprising way to find geometric structure. 


(a) Given any collection of five points on the unit sphere in R?, show that 
there is a closed half-space containing at least four of them. 

(b) Suppose we are given a collection of five vectors in R° such that no two 
of those vectors lie in a hyperplane. Show that we can always find four of 
them that form the edges of a 3-dimensional pointed cone. 


5.15. Let P C R® be a 3-dimensional integral polytope. Show that the sum 
of the relative areas of the facets of P equals the number of integer points on 
the boundary of P minus 2. 


5.16. Give an alternative proof of Theorem 5.6 by considering Lp (t) — Lpe(t) 
as the lattice-point enumerator of the boundary of P. 


Chapter 6 
Magic Squares 


The peculiar interest of magic squares and all lusus numerorum in general lies in the fact 
that they possess the charm of mystery. 


W. S. Andrews 


Fig. 6.1 Magic square at 
the Temple de la Sagrada 
Familia (Barcelona, Spain). 


Equipped with a solid base of theoretical results, we are now ready to return 
to computations. We use Ehrhart theory to assist us in enumerating magic 
squares. 

Loosely speaking, a magic square is an n x n array of integers (usually 
required to be positive, often restricted to the numbers 1,2,...,n?, usually 
required to have distinct entries) whose sum along every row, column, and 
main diagonal is the same number, called the magic sum. Magic squares 
have turned up time and again, some in mathematical contexts, others in 
philosophical or religious contexts. According to legend, the first magic square 
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(the ancient Luo Shu square) was discovered in China sometime before the 
first century B.C.E. on the back of a turtle emerging from a river. It looked 
like this: 


4} 9} 2 
3) 5 | 7 
8}; 1] 6 


Our task in this chapter is to develop a theory for counting certain classes 
of magic squares, which we now introduce. 


6.1 It’s a Kind of Magic 


One should notice that the Luo Shu square has the distinct entries 1,2,...,9, 
so these entries are distinct positive integers drawn from a particular set. Such 
requirements are too restrictive for our purposes. We define a semimagic 
square to be a square matrix whose entries are nonnegative integers and 
whose rows and columns sum to the same number. A magic square is a 
semimagic square whose main diagonals also add up to the line sum. Here is 
one example each of a semimagic and a magic square: 


3} 0] 0 1); 2) 0 
O} 1) 2 O} 1) 2 
Oo} 2] 1 2} 07; 1 


We caution the reader about clashing definitions in the literature. For 
example, some people reserve the term magic square for what we will call 
a traditional magic square, a magic square of order n whose entries are 
the distinct integers 1,2,...,n?. (The Luo Shu square is an example of a 
traditional magic square.) Others are slightly less restrictive and use the term 
magic square for a magic square with distinct entries. We stress that we do 
not make this requirement in this chapter. 

Our goal is to count semimagic and magic squares. In the traditional case, 
this is in some sense not very interesting:! for each order, there is a fixed 
number of traditional magic squares. For example, there are 7040 traditional 
4 x 4 magic squares. 

The situation becomes more interesting if we drop the condition of tradi- 
tionality and study the number of magic squares as a function of the line sum. 


1 It is, nevertheless, an incredibly hard problem to count all traditional magic squares of a 
given size n. At present, these numbers are known only for n < 5 [1, Sequence A006052]. 
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We denote the total numbers of semimagic and magic squares of order n and 
line sum t by H,,(t) and M,,(t), respectively. 


9 |t-9 : : 
2 2 

1-9 | 9 : x 
2 2 


Fig. 6.2 Semimagic and magic squares for n = 2. 


Example 6.1. We illustrate these notions for the case n = 2, which is not 
very complicated. Here a semimagic square is determined once we know one 
entry, say the upper left one, denoted by Y in Figure 6.2. Because of the 
upper row sum, the upper right entry has to be t — Y, as does the lower left 
entry (because of the left column sum). But then the lower right entry has to 
be t — (t — Y) = O (for two reasons: the lower row sum and the right column 
sum). The entry Y can be any integer between 0 and t. Since there are t+ 1 
such integers, 

Ao(t)=t4+1. (6.1) 


In the magic case, we have also to think of the diagonals. Looking back at our 
semimagic square in Figure 6.2, we see that the first diagonal gives 2.0 = t, 
or Q= f In this case, t — 0 = ‘, and so a 2 x 2 magic square has to have 
identical entries in each position. Because we require the entries to be integers, 
this is possible only if t is even, in which case we obtain precisely 1 solution, 
the square on the right in Figure 6.2. That is, 


1 if t is even, 
Ma ' if t is odd. 


These easy results already hint at something: the counting function H,, is of 
a different character from that of the function M,,. 


6.2 Semimagic Squares: Integer Points in the 
Birkhoff—von Neumann Polytope 


Just as the Frobenius problem was intrinsically connected to questions about 
integer points on line segments, triangles, and higher-dimensional simplices, 
magic squares and their relatives have a life in the world of geometry. The 
most famous example is connected to semimagic squares. 
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A semimagic n x n square has n? nonnegative entries that sum to the same 
number along every row and along every column. Consider, therefore, the 
polytope 


; ; , yo, Uj, =1 forall <k<n 
Paes : ; = DUS 5 Sa toa 1 gen ; 
Eni -.-. Inn 
(6.2) 

consisting of nonnegative real matrices, in which all rows and columns sum to 
1. The polytope B,, is called the n*® Birkhoff-von Neumann polytope, 
in honor of Garrett Birkhoff (1911-1996)? and John von Neumann (1903- 
1957).° Because the matrices contained in the Birkhoff-von Neumann polytope 
appear frequently in probability and statistics (the line sum 1 representing 
probability 1), B, is often described as the set of all n x n doubly stochastic 
matrices. 

Geometrically, 6, is a subset of IR” and as such is difficult to picture once 
n exceeds 1.4 However, we can get a glimpse of By C R* when we think about 
what form points in Bz can possibly attain. Very much in sync with Figure 6.2, 
such a point is determined by its upper left entry Y and looks like this: 


9 1-9 
1-9 Q9 ; 
The entry 9 is a real number between 0 and 1, which suggests that Bz should 


look like a line segment in 4-space. Indeed, the vertices of Bz should be given 
by 9 = 0 and Y = 1, that is, by the points 


10 01 
fail (95) €B: 


These results generalize: B,, is an (n — 1)?-polytope (see Exercise 6.3) whose 
vertices (Exercise 6.5) are the permutation matrices, namely, those n x n 
matrices that have precisely one 1 in each row and column (every other entry 
being zero). For dimensional reasons, we can talk about the continuous volume 
of B,, only in the relative sense, following the definition of Section 5.4. 

The connection of the semimagic counting function H,,(t) to the Birkhoff- 
von Neumann polytope 8, becomes clear in the light of the lattice-point 
enumerator for B,: the counting function H,,(¢) enumerates precisely the 
integer points in tB,, that is, 


2 For more information about Birkhoff, see 
http://www-history.mcs.st-andrews.ac.uk/Biographies/Birkhoff Garrett.html. 


3 For more information about von Neumann, see 
http: //www-history.mcs.st-andrews.ac.uk/Biographies/Von_Neumann.html. 


4 And the case n = 1 is not terribly interesting: B, = {1} is a point. 
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2 
H(t) = # (tBn NZ") = Le, (t). 


We can say more after noticing that permutation matrices are integer points 
in 6,, and so Ehrhart’s theorem (Theorem 3.8) applies: 


Theorem 6.2. H,,(t) is a polynomial in t of degree (n—1)?. 


The fact that H, is a polynomial—apart from being mathematically 
appealing—has the same nice computational consequence that we exploited 
in Section 3.7: we can calculate this counting function by interpolation. For 
example, to compute Hy, a linear polynomial, we need to know only two 
values. In fact, since we know that the constant term of Hz is 1 (by Corol- 
lary 3.15), we need only one value. It is not hard to convince even a lay person 
that H2(1) = 2 (which two semimagic squares are those?), from which we 
interpolate 

Ao(t)=t4+1. 


To interpolate the polynomial H3, we need to know four values aside from 
H3(0) = 1. In fact, we do not even have to know that many values, because 
Ehrhart—Macdonald reciprocity (Theorem 4.1) assists us in computations. To 
see this, let H°(t) denote the number of n x n square matrices with positive 
integer entries summing up to ¢ along each row and column. A moment’s 
thought (Exercise 6.6) reveals that 


H(t) = H,(t—n). (6.3) 


But there is a second relationship between H,, and H°, namely, H?(t) counts, 
by definition, the integer points in the relative interior of the Birkhoff—-von Neu- 
mann polytope B,,, that is, H}(t) = Lge (t). Ehrhart-Macdonald reciprocity 
(Theorem 4.1) now gives 


He(-t) = (-1)°-Y" A, (2). 


Combining this identity with (6.3) gives us a symmetry identity for the 
counting function for semimagic squares: 


Theorem 6.3. The polynomial H,, satisfies 


and 


H,(-1) = A,(—2) =:-- = H,(-n4+ 1) =0. 
The roots of H,, at the first n—1 negative integers follow from (Exercise 6.7) 
Ay (1) = A (2) =--- = He(n—-1) =0. 


Theorem 6.3 gives the degree of 6,,, and it implies that the numerator of the 
Ehrhart series of the Birkhoff-von Neumann polytope is palindromic: 
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Corollary 6.4. The Ehrhart series of the Birkhoff-von Neumann polytope 
B,, has the form 


h* 


(n-1)(n—2) 4... * 
-1)(n—2) 2 +:--+h 
Fie (2) = (n—1)(n—2) 0 


(1 = Pile le 


? 


where ho, hj, .. i -1)(n—2) € Z>0 satisfy hy = Rin 1)(n—2)—k forO<k< 
neh) 


Proof. Denote the Ehrhart series of B,, by 


Rin—12 Zo pe AAG 
(1 _ zr ret 


Ehrg,, (z) = 


The fact that hf, j)2 = = Rin—1)2— a 0 follows from the second 
part of Theorem 6.3 and Theorem 4.5. The palindromicity of the numerator 
coefficients follows from the first part of Theorem 6.3 and Exercise 4.8: it 
implies 


Ehrz, (<) = (—1)@-)? +120 Ehrg, (z) 
z 


which yields hy = Rin—1)(n—2)— , on simplifying both sides of the equation. 


Let’s return to the interpolation of H3: Theorem 6.3 gives, in addition to 
H3(0) = 1, the values 


H3(-3)=1 and  H3(-1) = H3(—2) =0. 


These four values together with H3(1) = 6 (see Exercise 6.1) suffice to 
interpolate the quartic polynomial H3, and one computes 


1 1 
42194 2a ee, (6.4) 


HG\= 
a(t) 8 4 8 4 


This interpolation example suggests the use of a computer; we let it calculate 
enough values of H,, and then simply interpolate. As far as computations 
are concerned, however, we should not get too excited about the fact that 
we computed Hy» and H3 so effortlessly. In general, the polynomial H,, has 
degree (n — 1)”, so we need to compute (n — 1)? + 1 values of H,, to be 
able to interpolate. Of those, we know n (the constant term and the roots 
given by Theorem 6.3), so n? — 3n + 2 values of H, remain to be computed. 
Ehrhart—Macdonald reciprocity reduces the number of values to be computed 
to ee, That is still a large number, as anyone can testify who has tried 
to get a computer to enumerate all semimagic 7 x 7 squares with line sum 
15. Nevertheless, it is a fun fact that we can compute H,, for small n by 
interpolation. It is amusing to test one’s computer against the constant-term 
computation we will outline below, and we invite the reader to try both. 
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For small n, interpolation is clearly superior to a constant-term computation 
in the spirit of Chapter 1. The turning point is somewhere around n = 5: 
the computer needs more and more time to compute the values H,,(t) as t 
increases. Methods superior to interpolation are needed. 


6.3 Magic Generating Functions and Constant-Term 
Identities 


Now we will construct a generating function for H,, for which we will use 
Theorem 2.13. The semimagic counting function H,, is the Ehrhart polynomial 
of the n* Birkhoff-von Neumann polytope B,, which, in turn, is defined as a 
set of matrices by (6.2). First we rewrite the definition of B,, to fit the general 
description (2.23) of a polytope. If we consider the points in 6, as column 
vectors in R™ (rather than as matrices in R"*"), then 


Br = {x € RS : Ax=b}, 


where 


(here we do not show the zero entries) and 
1 
1 
b=|.]|¢€Z™. 
1 
From this description of B,, we can easily build the generating function 


for H,. According to Theorem 2.13, for a general rational polytope P = 
{x € R¢, : Ax=b}, 


1 
Lp(t) = const (q —ge)(1—2%)---(1 ze} ’ 


where €1,C2,...,Cq denote the columns of A. In our special case, the columns 
of A are of a simple form: they contain exactly two 1’s and elsewhere 0’s. We 
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need one generating-function variable for each row of A. To keep things as clear 
as possible, we use 21, 22,.--, Zn for the first n rows of A (representing the row 
constraints of 6,,) and w,,W2,...,W, for the last n rows of A (representing 
the column constraints of 6,,). With this notation, Theorem 2.13 applied to 
B,, gives the following starting point for our computations: 


Theorem 6.5. The number H,(t) of semimagic n x n squares with line sum 
t satisfies 


1 


t 
Th<jren (1 — zjwx) Giger 25 Hi<ken wr) 


H(t) = const 


This identity is of both theoretical and practical use. One can use it to compute 
Hf3 and even Hy by hand. For now, we work on refining it further, exemplified 
by the case n = 2. 

We first note that in the formula for H2, the variables w; and w2 are 
separated, in the sense that we can write this formula as a product of two 
factors, one involving only w, and the other involving only we: 


Halt) = const (= : : Ee 


zizé (1— 2w1) (1 — zew1) wt (1 — z1we) (1 — zewe) wh 


Now we put an ordering on the constant-term computation: let’s first compute 
the constant term with respect to we, then the one with respect to w,. (We 
do not order the computation with respect to z, and z2 just yet.) Since 21, 
z2, and w, are considered constants when we do constant-term computations 
with respect to w2, we can simplify: 


1 
1— zQu1) wt 


x const, € zw.) a — 2202) a))) . 


Now we can see the effect of the separate appearance of w, and we: the 
constant-term identity factors. This is very similar to the factoring that can 
appear in computations of integrals in several variables. Let’s rewrite our 
identity to emphasize the factoring: 


1 
H(t) = const,, + (4 const y, (z Zama 


1 1 
H(t) = const ——; const 
(0) = conta (ep conten (Gti ey) 


di: 
tow , 
X CONStyw, (=> isant)) 


But now the expressions in the last two sets of parentheses are identical, 
except that in one case, the constant-term variable is called w;, and in the 
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other case, w2. Since these are just “dummy” variables, we can call them w, 
and combine: 


a(t) t — t : ; 
= 21,22 ONSty ‘ 
Cs Senne zi zk (1 — zw) (1 — zqw) wt 


(Note the square!) Naturally, all of this factoring works in the general case, 
and we invite the reader to prove it (Exercise 6.8): 


1 n 
Hy (i) = const, “++ Zn)" { const, 
(6.6) 
We can go further, namely, we can compute the innermost constant term 


1 
(1 — zw)-+- (1 — z,w)w 


const, a 


It should come as no surprise that we shall use a partial fraction expansion to 
1 1 


do so. The w-poles of the rational function are at w = ZU =f B= 
oo w = 0, and so we expand 
1 A A A i Be 
~ Sy arte “7 oe z+ (6.7) 
(1— z,w)--- (1 — 2z,w)w wc We Gh we 


Just as in Chapter 1, we can forget about the B;-terms, since they do not 
contribute to the constant term, that is, 


1 
t 
pee (1 — z1w)--- (1 — 2nw)wt 
A A A 
= const, 1 } a +---4 “| 
= AZ Agz See, AnZn - 


We invite the reader to show (Exercise 6.9) that 


A= oe 
(tan (aS) = ona s) 


Putting these coefficients back into the partial fraction expansion yields the 
following identity. 
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Theorem 6.6. The number H,(t) of semimagic n x n squares with line sum 


t satisfies 
n gtd te 
AT, (t) = con Zy+++ Bn) —______ . 


Amidst all this generality, we almost forgot to compute H2 with our partial 
fraction approach. The last theorem says that 


P geri Rae 2 
H(t) = const { (z1z2)~ ( + 2 ) 
21 — 22 22 — 21 


= const ( a a = ce ) . (6.9) 
(2i— 22)? (21-2)? (21 — 22)? 


It is now time to put more order on the constant-term computation. Let’s say 
that we first compute the constant term with respect to z,, and after that 
with respect to z2. So we have to compute first 


t+2,-t —t t+2 
Zz 212 Zz, 2 
const z, as , const z, me , and const,, eau = | 2 
(21 — 22)? (21 — 22)? (z1 — 22)? 
To obtain these constant terms, we need to expand the function aa As 


we know from calculus, this expansion depends on the order of the magnitudes 
of z, and zg. For example, if |z1| < |z2|, then 


and hence 


1 d i bpd k+l, 
2~ dz (—)-> FHI ~1 os k+2 al 


(21 — 22) > 72 ko 72 


So let’s assume for the moment that |z1| < |z2|. This might sound funny, since 
z, and zg are variables. However, as such they are simply tools that enable 
us to compute some quantity that is independent of z, and zg. In view of 
these ideas, we may assume any order of the magnitudes of the variables. In 
Exercise 6.11, we will check that indeed the order does not matter. Now, 


bat), re 
const, (4 — 2)? = 29 const z, (1 — 2)? 


= k+1, 
= 2, const,, | 27? a ae: ei (6.10) 


k>0 ~2 
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k+1 
_ o-t k+t+2 
= 2) const,, y kaa 71 


k>0 *2 


=0, 


since there are only positive powers of z, (recall that t > 0). Analogously (see 
Exercise 6.10), one checks that 


VARS 
const, (Ms) =U. (6.11) 


For the last constant term, we compute 


—tt+2 
const,, ( 1 “2 _ ) = 2+? const, [ 27% s ae zk 
zy (z, — z2)? aie” 21 1 k+2 ~1 
k>0 “2 


k+1 
t+2 k-t 
= 2," const., y kqz 71 


k>0 *2 


The constant term on the right-hand side is the term with k = t, that is, 


-t ,t+2 
zy % t+1 
const, (225) i aS) =t+l. 
2 


So of the three constant terms, only one survives, and with const, (¢+1) = t+1 
we recover what we have known since the beginning of this chapter: 


2 <4 8 
Ha(t) = const (2 *2 eee ee :) =t+1. 


(21 — 22)? (z1 — 22) 


This was a lot of work for this seemingly trivial polynomial. Recall, for 
example, that we can get the same result by an easy interpolation. However, 
to compute a similar interpolation, e.g., for H4, we likely would need to 
use a computer (to obtain the interpolation values). On the other hand, the 
constant-term computation of H4 boils down to only five iterated constant 
terms, which can actually be computed by hand (see Exercise 6.14). The 
result is 


i 11 19 2 1109 43 35117 
H,(t) = ‘4 a 7 t8 4 eal 3 
a(t) 11340 630° isp 8 an 10 5670 
379 65 


ee ep 
o Ga ge 
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6.4 The Enumeration of Magic Squares 


What happens when we bring the diagonal constraints, which are not present 
in the semimagic case, into the magic picture? In the introduction of this 
chapter we have already seen an example, namely the number of 2 x 2 magic 
squares, 

1 if t is even, 


MA fe if t is odd. 


This is a very simple example of a quasipolynomial. In fact, like H,, the 
counting function M,, is defined by integral linear equations and inequalities, 
so it is the lattice-point enumerator of a rational polytope, and Theorem 3.23 
gives at once the following result. 


Theorem 6.7. The counting function M,(t) is a quasipolynomial in t. 


We invite the reader to prove that the degree of M,, is n? — 2n — 1 (Exer- 


cise 6.16). 
Let’s see what happens in the first nontrivial case, 3 x 3 magic squares. 
We follow our recipe and assign variables m,,mg2,...,7™g to the entries of our 


3 x 3 squares: 


my, | m2} ™3 


M4 | 775 | ME 


M7 | Mg | M9 


The magic conditions require now that m1, mz,...,mg € Z>o and 


my, +m2+ms3=t, 


M+ M5 + M6 =F, 
m7 + Mg + mg =F, 
my +m4+m7,=t, 
m2 +™M5 + Mg =F, 
mz + Mg + Mg =F, 


M1 +Ms5 + Mg =F, 


m3 +ms5 +m7=t, 


according to the row sums (the first three equations), the column sums (the 
next three equations), and the diagonal sums (the last two equations). By 
now, we are experienced in translating this system into a generating function: 
we need one variable for each equation, so let’s take 21, 22, 23 for the first 
three, w1, W2, w3 for the next three, and y1, y2 for the last two equations. The 
function M3(t) is thus the constant term of 
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1 


(1 = z1W1y1) (1 = 21 We) (1 = 21 W3Y2) (1 — 221) (1 = 22W2Y1Y2) (1 = 22W3) 


1 
x a (6.12) 
(1 — zgwiy2) (1 — z3we) (1 — zgw3y1) (212223W1W2W3y1Y2) 


It does take some work, but it is instructive to compute this constant term 
(just try it!). The result is 


M3(t) = 


24721 2 if 3l¢ 
oe if 3|t, es 


otherwise. 


As predicted by Theorem 6.7, M3 is a quasipolynomial. It has degree 2 and 
period 3. This may be more apparent if we rewrite it as 


2 4+2t+1 ift=0 mod 3, 
M3(t) = «0 if ¢= 1 mod 3, 
0 if t= 2 mod 3, 


and we can see the three constituents of the quasipolynomial M3. There is 
an alternative way to describe M3; namely, let 


2 if t= 0 mod 3, 
ce(t)= 40 ift=1mod3, 
0 ift=2mod3, 
2 if t= 0 mod 3, 
a(t)= 40 ift=1mod3, 
0 ift=2 mod 3, 
1 ift=0 mod 3, 
co(t)= 40 ift=1 mod 3, 
O ift =2 mod 3. 


Then the quasipolynomial M3 can be written as 


M3(t) = co(t) t? + c1(t) t+ co(t). 


Notes 


1. Magic squares date back to China in the first millennium B.C.E. [77]; they 
underwent much further development in the Islamic world late in the first 
millennium C.E. and in the next millennium (or sooner; the data are lacking) 
in India [78]. From Islam, they passed to Christian Europe in the later Middle 
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Ages, probably initially through the Jewish community [78, Part II, pp. 290 
ff.] and later possibly Byzantium [78, Part I, p. 198], and no later than the 
early eighteenth century (the data are buried in barely tapped archives) to 
sub-Saharan Africa [256, Chapter 12]. The contents of a magic square have 
varied with time and writer; usually they have been the first n? consecutive 
integers, but often any arithmetic sequence or arbitrary positive numbers. 
In the past century, mathematicians have made some simplifications in the 
interest of obtaining results about the number of squares with a fixed magic 
sum, in particular, allowing repeated entries as in this chapter. 


2. The problem of counting magic squares (other than traditional magic 
squares) seems not to have occurred to anyone before the twentieth century, no 
doubt because there was no way to approach the question previously. The first 
nontrivial formulas addressing the counting problem, namely (6.4) and (6.13) 
for H3 and M3, were established by Percy Macmahon in 1915 [167]. Recently, 
there has grown up a literature on exact formulas (see, for example, [110,224] 
for semimagic squares; for magic squares, see [2,35]; for magic squares with 
distinct entries, see [47, 48, 254]). 


3. Another famous kind of square is a Latin square (see, for example, [101]). 
Here each row and column has n different numbers, the same n numbers in 
every row/column (usually taken to be the first n positive integers). There 
are counting problems associated with Latin squares, which can be attacked 
using Ehrhart theory [48] (see also [1, Sequence A002860}). 


A. Recent work includes mathematical-historical research, such as the discovery 
of unpublished magic squares of Benjamin Franklin [3, 187]. Aside from 
mathematical research, magic squares and their siblings naturally continue 
to be an excellent source of topics for popular mathematics books (see, for 
example, [7,191]). 


5. The Birkhoff-von Neumann polytope B,, possesses fascinating combinatorial 
properties [56,69,70,83,257] and relates to many mathematical areas [104,151]. 
Its name honors Garrett Birkhoff and John von Neumann, who proved that the 
extremal points of B,, are the permutation matrices [57,251] (see Exercise 6.5). 
A long-standing open problem is the determination of the relative volume of 
B,,, which is known only for n < 10 [1, Sequence A037302]. In fact, the last 
two records (n = 9 and 10) for computing vol 6,, rely on the theory of counting 
functions that is introduced in this book, more precisely, Theorem 6.6 [42]. 


6. An important generalization of the Birkhoff-von Neumann polytope is 
given by transportation polytopes, which consist of contingency tables. They 
have applications to statistics and in particular to disclosure limitation proce- 
dures [97]. The Birkhoff-von Neumann polytopes are special transportation 
polytopes that consist of two-way contingency tables with given 1-marginals. 
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7. The polynomiality of H,, (Theorem 6.2) and its symmetry (Theorem 6.3) 
were conjectured in 1966 by Harsh Anand, Vishwa Dumir, and Hansraj Gupta 
[6] and proved seven years later independently by Eugéne Ehrhart [110] and 
Richard Stanley [224]. See [72,231] for more about the history of Theorems 6.2 
and 6.3 and their connections to commutative algebra. Stanley also conjectured 
that the numerator coefficients in Corollary 6.4 are unimodal, a fact that 
was proved only in 2005, by Christos Athanasiadis [13] (see also [74]). The 
quasipolynomiality of M,, (Theorem 6.7) and its degree are discussed in [35]. 
The period of M,, is in general not known. In [35], it is conjectured that it 
is always nontrivial for n > 1. The work in [2] gives some credence to this 
conjecture by proving that the polytope of magic n x n squares is not integral 
for n > 2. 


8. We close with a story about Cornelius Agrippa’s De Occulta Philosophia, 
written in 1510. In it he describes the spiritual powers of magic squares and 
produces some squares of orders from three up to nine. His work, although 
influential in the mathematical community, enjoyed only brief success, for 
the Counter-Reformation and the witch hunts of the Inquisition began soon 
thereafter: Agrippa himself was accused of being allied with the devil. 


Exercises 


6.1. & Find and prove a formula for H,,(1). 
6.2. Let (ij) <; <3 be a magic 3 x 3 square. 


(a) Show that the center term x22 is the average over all ;;. 
(b) Show that M3(t) = 0 if 3 does not divide t. 


6.3. & Prove that dim B, = (n — 1)?. 


6.4. Prove the following characterization of a vertex of a convex polytope P: 
A point v € P is a vertex of P if and only if for every line L through v and 
every neighborhood N of v, there exists a point in DM N that is not in P. 


6.5. & Prove that the vertices of B, are the n x n permutation matrices. 


6.6. & Let H?(t) denote the number of n x n matrices with positive integer 
entries summing up to t along each row and column. Show that H?(t) = 
H,,(t —n) fort >n. 


6.7. d& Show that H2°(1) = H°(2) =--- = H°(n—1) =0. 
6.8. de Prove (6.6): 
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6.9. & Compute the partial fraction coefficients (6.8). 
6.10. & Verify (6.11). 


6.11. Repeat the constant-term computation of H starting from (6.9), but 
now by first computing the constant term with respect to z2, and after that 
with respect to 21. 


6.12. Use your favorite computer program to calculate the formula for H3(t), 
H4(t),... by interpolation. 


6.13. Compute H3 using Theorem 6.6. 
6.14. Compute H, using Theorem 6.6. 
6.15. Show that 


n gttn-l 


k —_ . M1... Mn 
= 21 Zn > 


k=1 T1546 (2x _ z) mite-+my=t 


and use this identity to give an alternative proof of Theorem 6.6. 
6.16. & Prove that for n > 3, the degree of M,, is n? —2n—1. 


6.17. Explain the period of M3 by computing the vertices of the polytope of 
3 x 3 magic squares. 


6.18. & Verify (6.12) and use it to compute M3. 
6.19. Compute M3 by interpolation. (Hint: Use Exercises 6.2 and 6.17.) 


6.20. A symmetric semimagic square is a semimagic square that is a 
symmetric matrix. Show that the number of symmetric semimagic n x n 
squares with line sum ¢ is a quasipolynomial in t. Determine its degree and 
period. 


Open Problems 


6.21. Compute the number of traditional magic n x n squares for n > 5. 
6.22. Compute vol B,, for n > 10. Compute H,, for n > 9. 
6.23. Prove that the period of M,, is nontrivial for n > 1. 


6.24. The vertices of the Birkhoff-von Neumann polytope are in one-to-one 
correspondence with the elements of the symmetric group S,,. Consider a 
subgroup of $,, and take the convex hull of the corresponding permutation ma- 
trices. Compute the Ehrhart polynomials of this polytope. (The face numbers 
of the polytope corresponding to the subgroup A,,, the even permutations, 
were studied in [141].) 
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6.25. Consider the rational polytope S,, formed by all symmetric n x n doubly 
stochastic matrices (see Exercise 6.20). 


(a) Write Ehrs, (z) as a rational function with denominator (1 — 22)(2) 44, 
Prove that the numerator polynomial is palindromic. 
(b) Prove that the numerator polynomial of Ehrggs, (z) is palindromic. 


(See [92] for partial results.) 


6.26. Prove that the graph formed by the vertices and edges of every 2-way 
transportation polytope is Hamiltonian. 


Part IT 
Beyond the Basics 


Chapter 7 
Finite Fourier Analysis 


God created infinity, and man, unable to understand infinity, created finite sets. 


Gian—Carlo Rota (1932-1999) 


We now consider the vector space of all complex-valued periodic functions on 
the integers with period b. It turns out that every such function a(n) on the 
integers can be written as a polynomial in the b' root of unity €” := e2?7"”/°, 
Such a representation for a(n) is called a finite Fourier series. Here we 
develop finite Fourier theory using rational functions and their partial fraction 
decomposition. We then define the Fourier transform and the convolution of 
finite Fourier series, and show how one can use these ideas to prove identities 
on trigonometric functions, as well as find connections to the classical Dedekind 
sums. 

The more we know about roots of unity and their various sums, the 
deeper are the results that we can prove (see Exercise 7.20); in fact, certain 
statements about sums of roots of unity even imply the Riemann hypothesis! 
However, this chapter is elementary and draws connections to the sawtooth 
functions and Dedekind sums, two basic sums over roots of unity. The general 
philosophy here is that finite sums of rational functions of roots of unity are 
basic ingredients in many mathematical structures. 


7.1 A Motivating Example 


To ease the reader into the general theory, let’s work out the finite Fourier 
series for a simple example first, an arithmetic function with a period of 3. 


Example 7.1. Consider the following arithmetic function, of period 3: 
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n:0,1,2,3,4,5,... 
a(n) :1,5,2,1,5,2,... 


We first embed this sequence into a generating function as follows: 


F(z) = 1452422? + 234524422? +---=S a(n) 2”. 
n>0 


Since the sequence is periodic, we can simplify F(z) using a geometric series 
argument: 


F(z)= y; a(n) 2” 


n>0 
= 14524227 + 28 (14524 227) +29 (14524227) +--- 
(1+ 52 +227) So 234 
k>0 
1+52+ 22? 
1— 23 : 


l| 


We now use the same technique that was employed in Chapter 1, namely the 
technique of expanding a rational function into its partial fraction decompo- 
sition. Here all the poles are simple, and located at the three cube roots of 
unity, so that 


a(o) | _a@(1) a(2) 

F(z)= al 
(2) (oe ape a rt) 
where the constants (0), @(1), @(2) remain to be found, and where p := e?7/8, 
a third root of unity. Using the geometric series for each of these terms 


separately, we arrive at 


F(z) = S> (@(0) + G(1)p” + 4(2)p"") 2”, 
n>0 


so that we have derived the finite Fourier series of our sequence a(n). The 
only remaining piece of information that we need is the computation of the 
constants @(j), for 7 = 0,1,2. It turns out that this is also quite easy to do. 
We have, from (7.1) above, the identity 


a(0) (1 — pz) (1 — p*z) + @(1) (1 — 2) (1 — p*z) + @(2) (1 — 2) (1— pz) 
=14+5z2+4+ 227, 


valid for all z € C. On letting z = 1, p?, and p, respectively, we obtain 
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3a(0) =14+542, 
34(1) =1+4 5p? + 2p%, 
34(2) =1+5p + 2p”, 
where we have used the identity (1 — p)(1 — p?) = 3 (see Exercise 7.2). We 
8 


can simplify a bit to get 4(0) = §, a(1) = 52, and 4(2) = +". Thus 
the finite Fourier series for our sequence is 


The object of the next section is to show that this simple process follows just 
as easily for every periodic function on Z. The ensuing sections contain some 
applications of the finite Fourier series of periodic functions. 


7.2 Finite Fourier Series for Periodic Functions on Z 


The general theory is just as easy conceptually as Example 7.1, and we now 
develop it. Consider a periodic sequence on Z, defined by {a(n)}°,, of period 
b. Throughout the chapter, we fix the b*” root of unity € := e?7’/. As before, 


we embed our periodic sequence {a(n)}°°_) into a generating function, 


n>0 


and use the periodicity of the sequence to immediately get 


b-1 b-1 b—-1 
F(z) = (x a(k) “) + (~ a(k) “| f+ bs a(k) “| ge... 


k=0 k=0 k=0 
b-1 
et a(k) z* _ P(z) 
1— 2° 1— 2’ 
where the last step simply defines the polynomial P(z) = = a(k) z*. Now 


we expand the rational generating function F(z) into partial fractions, as 


before: ia 
re) = eh 


m=0 


1 


As in Example 7.1, we expand each of the terms [remy asa geometric series, 


and substitute into the sum above to get 
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F(z)= a(n) 2” = ~ ae 
n>0 m=0 
b-1 b-1 
Sam ere «Z(Bamen) 
m=0 n>0 n>0 \m=0 


Comparing the coefficients of any fixed z” gives us the finite Fourier series 


for a(n), namely 
b-1 


a(n) = ) 7 a(m) en”. 


m=0 


We now find a formula for the Fourier coefficients G(n), as in the example. 
To recapitulate, 


Pe) = Yo am 
er! 1—€™z 
To solve for P(€~”), we note that 
i= <p a #0 mod b 
in SS _ 
pe ee if m-—n mod b, 
and 
= b b b-1 
lim ——*—= lim “~— =pe7-™=5 if m—n=0modb. 


zen L— Mz zZE-n em 

Thus P(€~") = ba(n), and so 
b-1 

1 

: a( k)e-™. 
k=0 


a(n) = +P = 


We have just proved the main result of finite Fourier series, using only 
elementary properties of rational functions: 


Theorem 7.2 (Finite Fourier series expansion and Fourier inver- 
sion). Let a(n) be any periodic function on Z, with period b. Then we have 
the following finite Fourier series expansion: 


b-1 
a(n) =) a(k)e™", 
k=0 
where the Fourier coefficients are 
yoo 
a(n) = +S ataye*’, (7.2) 
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with € = e27*/>, 


The coefficients @(m) are known as the Fourier coefficients of the function 
a(n), and if @(m) 4 0, we sometimes say that the function has frequency m. 
The finite Fourier series of a periodic function provides us with surprising 
power and insight into its structure. We are able to analyze the function 
using its frequencies (only finitely many), and this window into the frequency 
domain becomes indispensable for computations and simplifications. 

We note that the Fourier coefficients G@(m) and the original sequence elements 
a(n) are related by a linear transformation given by the matrix 


i (e690) (7.3) 


where 1 < i,j < b, as is evident from (7.2) in the proof above. We further note 
that the second half of the proof, namely solving for the Fourier coefficients 
a(n), is just tantamount to inverting this matrix L. 

One of the main building blocks of our lattice-point enumeration formulas 
in polytopes is the sawtooth function, defined by 


_ j{e}-§ ife¢Z, 
(2) = £ if a eZ. 2) 


(As a reminder, {x} = x — || is the fractional part of w.) The graph of this 
function is displayed in Figure 7.1. We have seen a closely related function 
before, in Chapter 1, in our study of the coin-exchange problem. Equation (1.8) 
gave us the finite Fourier series for essentially this function from the discrete- 
geometry perspective of the coin-exchange problem; however, we now compute 
the finite Fourier series for this periodic function directly, pretending that we 
do not know about its other life as a counting function. 


Fig. 7.1 The sawtooth function y = ((z)). 


Lemma 7.3. The finite Fourier series for the discrete sawtooth function (¢)), 
a periodic function of a € Z with period b, is given by 
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- b-l1 
(f)- RE eet = EF ten 


Here the second equality follows from eae = icot(7x), by the definition 


of the cotangent. 


Proof. Using Theorem 7.2, we know that our periodic function has a finite 


Fourier series ((¢)) = = 1 a(k ) €%*, where 
1 7m 
mS a pias mk 
au =1 5 (me 
We first compute 4(0) = } 7”, ((™)) =0, by Exercise 7.15. For k #0 
b-1 b-1 
eS = 1 m 1 —mk _ 1 —mk 1 
ee ae le Pie 
~ b\1L-€& 2) 2b1-—€F’ 


where we used Exercise 7.5 in the penultimate equality above. 


We define the Dedekind sum by 


eG) 


for any two relatively prime integers a and b > 0 (although it may be easily 
extended to all integers—see Exercise 8.11). Note that the Dedekind sum is a 
periodic function of the variable a, with period b, by the periodicity of the 
sawtooth function. That is, 


s(a+ jb,b)=s(a,b) forall f EZ. (7.5) 


Using the finite Fourier series for the sawtooth function, we can now easily 
reformulate the Dedekind sums as a finite sum over the b'” roots of unity or 
cotangents: 


Lemma 7.4. 


1+é4 14+€-" us 
b) = a 
oi) = oe ES - Be 


We will see and use this alternative formulation of the Dedekind sum time 
and again. 
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Proof. 
b-1 
ka k 
vo = 3 ((5)) ((5)) 
k=0 
b-1 b-1 b-1 
_ 1 Lape pka Le vk 
-#y((Ets i 
k=0 \ \y=1 v=1 
b—1 b-1 7 b-1 
ye 1+é"1+€ So ehetua) 
4b 1-—é#1-€& 
pw=lv=1 k=0 
We note that the last sum we, ekv+42) vanishes, unless v = —ya mod b 


(Exercise 7.6), in which case the sum equals b, and we obtain 


s(a, b) 


1+eh1+e-" 
poise ue” 


Rewriting the right-hand side in terms of cotangents gives 


s(a,b) = a S- cot a cot = i pe cot a cot eo 


because the cotangent is an odd function. 


7.3 The Finite Fourier Transform and Its Properties 


Given a periodic function f on Z, we have seen that f possesses a finite 
Fourier series, with the finite collection of Fourier coefficients that we called 
f(0),f( ofl — 1). We now regard f as a function on the finite set 
G = {0,1,2,...,b—1}, and let Ve be the vector space of all complex-valued 
functions on G. Equivalently, Vg is the vector space of all complex-valued 
periodic functions on Z with period b. 

We define the Fourier transform of f, denoted by F(f), to be the periodic 
function on Z defined by the sequence of uniquely determined values 


f(0), f(), ..., f(b-1). 
Thus . 
F(f)(m) = f(m). 


Theorem 7.2 above gave us these coefficients as a linear combination of the 
values f(k), with k = 0,1,2,...,b—1. Thus F(f) is a linear transformation of 
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the function f, thought of as a vector in Vg. In other words, we have shown 
that F(f) is a one-to-one and onto linear transformation of Va. 

The vector space Vg is a vector space of dimension }; indeed, an explicit 
basis can easily be given for Vg using the delta functions (see Exercise 7.7) 
defined by 

f if z= m-+kb, for some integer k, 
bm(X) := . 
0 otherwise. 
In other words, 6;,(a) is the periodic function on Z that picks out the arith- 
metic progression {m+ kb: k € Z}. 

But there is another natural basis for Vg. For a fixed integer a, the roots 
of unity {e,(x) := e?777/° » x € Z} can be thought of as a single function 
e,(z) € Va@ because of its periodicity on Z. As we saw in Theorem 7.2, the 
functions {e)(x),...,e(x)} give a basis for the vector space of functions Vg. 


A natural question now arises: how are the two bases related to each other? 
An initial observation is 


pe 1 : 
da(n) a: goomaa 


which simply follows from the computation 


1, = —kn 1 —an 1 —27rian/b 
k=0 


So to get from the first basis to the second basis, we need precisely the finite 
Fourier transform! 

It is extremely useful to define the following inner product on the vector 
space Vg: 


(fa) = > fe) 9), (7.6) 


for two functions f,g € Vg. Here the bar denotes complex conjugation. The 
following elementary properties show that (f,g) is an inner product (see 
Exercise 7.8): 


1. (f, f) > 0, with equality if and only if f = 0, the zero function. 
2. (f,9) = (9, f). 


Equipped with this inner product, Vg can now be regarded as a metric space: 
We can measure distances between two functions, and in particular between 
two basis elements eg(x) := e?7¢*/> and e,(x) := e?™*/>, Every positive 
definite inner product gives rise to the distance function \/(f — g, f — g). 
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Lemma 7.5 (Orthogonality relations). 


1 ifb|(a—o), 


0 otherwise. 


ole 


(Ca, €c) = bale) = 


Proof. We compute the inner product 


b-1 b-1 
(6i6c) = oa e,(m) e.(m) = 23 e2ti(a—c)m/b_ 
m=0 m=0 


If b| (a—c), then each term equals 1 in the latter sum, and hence the sum 
equals b. This verifies the first case of the lemma. 

If bt (a—c), then eg_-(m) = e27(¢—-9/® is a nontrivial root of unity, and 
we have the finite geometric series 


p21 2Qni(a—c) 

2ni(a—c)m @? ee 
Soe cmH eee =, 
m=0 € B = 


verifying the second case of the lemma. 


Example 7.6. We recall the sawtooth function again, since it is one of the 
building blocks of lattice-point enumeration, and compute its Fourier trans- 
form. Namely, we define 


k Eliot Pt ¢e7, 
B= (1 =)\\= aa ty ¢ ’ 
b 0 if D E Z, 
a periodic function on the integers with period b. What is its finite Fourier 
transform? We have already seen the answer, in the course of the proof of 
Lemma 7.3: 
a Li+g* 4 7m 


() = sen = tS 


for n £0 and B(0) = 0. As always, € = e277/°, 


In the next section, we delve more deeply into the behavior of this inner 
product, where the Parseval identity is proved. 


7.4 The Parseval Identity 


A nontrivial property of the inner product defined above is the following 
identity, linking the “norm of a function” to the “norm of its Fourier transform.” 
It is known as the Parseval identity, and also goes by the name of the Plancherel 
theorem. 
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Theorem 7.7 (Parseval identity). For all f € Va, 
(Ff) =O fy. 


Proof. Using the definition e,,(x) = €™®” and the relation 


m=0 n=0 
b—1 b-1 b-1 
= f(m) @m(k) (n) en(k) 
k=0 m=0 n=0 
b-1 b-1 _ 
= LD fm) FR) (ems en) 
m=0 n=0 


| 
ole 
M 
M 
= 
SS 
SYS 
ey 
S 
3 
= 


where the essential step in the proof was using the orthogonality relations 
(Lemma 7.5) in the fourth equality above. 


A basically identical proof yields the following stronger result, showing 
that the “distance between two functions” is essentially equal to the “distance 
between their Fourier transforms.” 


Theorem 7.8. For all f,g € Va, 


(f,9) =b(f,9). 


Example 7.9. A nice application of the generalized Parseval identity above 
now gives us Lemma 7.4 very quickly, the reformulation of the Dedekind sum 
as a sum over roots of unity. Namely, we first fix an integer a relatively prime 
to b and define f(k) = ((#)) and g(k) = ((4)). Then, using Example 7.6, 


f(n) = # cot 3. To find the Fourier transform of g, we need an extra twist. 


Since 
ka _ 4 9 : ™m emka 
b 7 2b m=1 7 b 


we can multiply each index m by a~!, the multiplicative inverse of a modulo 
b (recall that we require a and b to be relatively prime for the reformulation 
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of the Dedekind sum). Since a~! is relatively prime to b, this multiplication 
just permutes m = 1,2,...,b— 1 modulo b, but the sum remains invariant 
(see Exercise 1.9): 


b-1 b-1 b-1 
TMG TMG 
S> cot ae S- cot ; Eine Tha = ot gmk 
m=1 m=1 m=1 
that is, 
a ( ) u t mat 
n) = — cot ——— 
2 2b b 


Hence Theorem 7.8 immediately gives us the reformulation of the Dedekind 


on ODE) 


a ‘: A =1 
ag u ‘st TIN a aok TING 
A 2b b 2b b 


bi 

= em cop TG 1 
ee cot —c 

Ab b b 

m=1 

= ma 7mm 
= 1 2m 

go” bo 


For the last equality, we again used the trick of replacing m by ma. 


7.5 The Convolution of Finite Fourier Series 


Another basic tool in finite Fourier analysis is the convolution of two finite 
Fourier series. Namely, suppose f and g are periodic functions with period b. 
We define the convolution of f and g by 


(f *g)(t sa (m). 


m=0 


Indeed, it is this convolution tool (the proof of the convolution theorem 
below is almost trivial) that is responsible for the fastest known algorithm for 
multiplying two polynomials of degree b in O(blog(b)) steps (see the notes at 
the end of this chapter). 


viva e a (Convolution paeaies for finite Fourier series). Let 
iie=3 = ane E* and g(t) =; = 6 Ck Ek where € = e27*/>, Then their 
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convolution satisfies 


(f * 9) =F Yome 


Proof. The proof is straightforward: we just compute the left-hand side, and 
obtain 


b-1 y bat /o- b-1 
st —maton)= je Ye (Smet) (Tae) 
m=0 m=0 
y bxbb-1 
= oe (So eneeom) 
k=0 1=0 
-1 
1. kt 
=>) acne”, 
ear 
because the sum | é(—k)™m vanishes, unless | = k (see Exercise 7.6). In 


the case that | = k, we have ee E-k)m — p, 


It is an easy exercise (Exercise 7.23) to show that this convolution theorem 
is equivalent to the following statement: 


F(f *g) =bF(f) F(g)- 


Note that the proof of Theorem 7.10 is essentially identical to the proof of 
Lemma 7.4 above; we could have proved the lemma, in fact, by applying the 
convolution theorem. We now show how Theorem 7.10 can be used to derive 
identities on trigonometric functions. 


Example 7.11. We claim that 


St (BH) OO. 


The sum suggests the use of the convolution theorem, with a function whose 
Fourier coefficients are a, = cy = cot ak . But we already know such a function! 
It is just the sawtooth function 22 (2 n)). Therefore, 


at 


3 Lo" (Fe-¥ (52) G): 


=1 


where the equality follows from Theorem 7.10. On setting t = 0, we obtain 
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b-1 b-1 


1 ft 1 
=e | pom q( 1) 
_ (b= 1)(0—2) 
7 12b : 


as desired. We used the identity ((=“)) = — ((#)) in the first equality above, 
and some algebra was used in the last equality. Notice, moreover, that the 
convolution theorem gave us more than we asked for, namely an identity for 
every value of t. 


Notes 


1. Finite Fourier analysis offers a wealth of applications and is, for example, one 
of the main tools in quantum information theory. For the reader interested in 
going further than the humble beginnings outlined in this chapter, we heartily 
recommend Audrey Terras’s monograph [242]. 


2. The Dedekind sum is our main motivation for studying finite Fourier series, 
and in fact, Chapter 8 is devoted to a detailed investigation of these sums, in 
which the Fourier-Dedekind sums of Chapter 1 also finally reappear. 


3. The reader may consult [155, p. 501] for a proof that two polynomials of 
degree N can be multiplied in O(N log N) steps. The proof of this fact runs 
along the following conceptual lines. First, let the two given polynomials of 
degree N be f(x) =  s a(n) x” and g(x) = 2 b(n) #”. Then we know 
that f(€) and g(€) are two finite Fourier series, and we abbreviate them by 
f and g, respectively. We now note that bfg = F(F~'(f) * F~+(g)). If we 
can compute the finite Fourier transform (and its inverse) quickly, then this 
argument shows that we can multiply polynomials quickly. It is a fact of life 
that we can compute the Fourier transform of a periodic function of period 
N in O(N log N) steps, by an algorithm known as the fast Fourier transform 
(again, see [155] for a complete description). 


4. The continuous Fourier transform, defined by [~. f(t) e~?*"" dt, can be 
related to the finite Fourier transform in the following way. We approximate 
the continuous integral by discretizing a large interval [0,a]. Precisely, we let 


A := §, and we let t, := kA = Fa Then 
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a b 
| f(t) eo ot ite dt x S- f (ts) en onttee A 
u k=1 


which is in essence a finite Fourier series for the function f ($2) as a function 
of « € Z. Hence finite Fourier series find an application to continuous Fourier 
analysis as an approximation tool. 


Exercises 


Throughout the exercises, we fix an integer b > 1 and let € = e?7%/°. 


7.1. Show that 1-2? = iemere! — €¥z). 
7.2. d Show that []j~;(1— €*) = b. 


7.3. Consider the matrix that arose in the proof of Theorem 7.2, namely 
L = (aij), with aj; := e0=-DG-1) and with 1 < i,j < b. Show that the matrix 
Re is a unitary matrix (recall that a matrix U is unitary if U*U = I, where 
U* is the conjugate transpose of U). Thus, this exercise shows that the Fourier 


transform of a periodic function is always given by a unitary transformation. 


vb 
number z. (It turns out that det(Z) can sometimes be a complex number, but 
we will not use this fact here.) 


7.4. Show that [det (42)| = 1, where |z| denotes the norm of the complex 


7.5. & Show that for every integer a relatively prime to b, 


1 ie 
5 oLKE ak rs 


7.6. & Let n be an integer. Show that the sum ba €’" vanishes, unless 
n = 0 (mod b), in which case it is equal to b. 


7.7. & For an integer m, recall that we defined the delta function 


tes 1 if*=m-+ab, for some integer a, 
a= 
vs 0 otherwise. 


The b functions 4; (a),...,6y(a) are clearly in the vector space Vg, since they 
are periodic on Z with period b. Show that they form a basis for Va. 


7.8. & Prove that for all f,g € Va: 
(a) (f, f) > 0, with equality if and only if f = 0, the zero function. 
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(b) (f,9) = (9, f)- 


7.9. Show that 7y-) par = 5. 


7.10. Show that (da, 6c) = da(c). 
7.11. Prove that (f * 6a)(x) = f(x — a). 


7.12. Prove that d, +5. = S(atc) mod b+ 


_ 2riax 
b 


7.13. Let g(x) = x —a. Prove that fo g(a) - f(z) € 


7.14. Show that the Fourier transform can have only the eigenvalues +1 


7.15. & Prove that for every real number z, ((x)) = a: (=). 


7.16. If x is not an integer, show that ae cot (r24#) = bcot (mz). 


7.17. Show that for every integer a relatively prime to b, 
atl _ 1 
Dee 
~~ (@-1E=1) 


pth 


where the sum is taken over all roots of unity € except € = 1. 


7.18. We call a root of unity e?7'*/' a primitive b' root of unity if a is 
relatively prime to b. Let ©,(a) denote the polynomial with leading coefficient 
1 and of degree $(b)! whose roots are the ¢(b) distinct primitive b'® roots 
of unity. This polynomial is known as the cyclotomic polynomial of order b. 


Show that 
[] 2e(e) = 2" -1, 
d|b 


where the product is taken over all positive divisors d of b. 


7.19. We define the Mobius function for positive integers n by 


1 ifn=1, 
p(n) = <0 if n is divisible by a square, 
(—1)* if n is square-free and has k prime divisors. 


Deduce from the previous exercise that 


,(x) = I (a = ie 


d|b 


1 6(b) := #{k € [1,b— 1] : gcd(k, b) = 1} is the Euler ¢-function. 
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7.20. Prove that for every positive integer }, 


e2tia/b _ u(b) , 
1<a<b 


gcd(a,b)=1 
the Mobius function. 
7.21. Show that for every positive odd integer k, 


k-1)(k-2 
s(1,k) = eet) a ‘ 
7.22. Show that, if b is odd, then 772} tan? (“) = b(b— 1). 
7.23. & Show that Theorem 7.10 is equivalent to the following statement: 
F(f *g) =bF(f) F(g). 


7.24. Consider the trace of the linear transformation L = (E0-DG “D), de- 
fined in (7.3). The trace of L is G(b) := =) é™ known as a Gau8 sum. 
Show that |G(b)| = Vo if b is an odd prime. 


Chapter 8 


Dedekind Sums, the Building Blocks of 
Lattice-Point Enumeration 


If things are nice there is probably a good reason why they are nice: and if you don’t know 
at least one reason for this good fortune, then you still have work to do. 


Richard Askey 


We encountered Dedekind sums in our study of finite Fourier analysis in 
Chapter 7, and we became intimately acquainted with their siblings in our 
study of the coin-exchange problem in Chapter 1. They have one shortcoming, 
however (which we shall remove): the definition of s(a,b) requires us to sum 
over b terms, which is rather slow when b = 2'°, for example. Luckily, there 
is a magical reciprocity law for the Dedekind sum s(a,b) that allows us to 
compute it in roughly log,(b) = 100 steps, in the example above. This is the 
kind of magic that saves the day when we try to enumerate lattice points in 
integral polytopes of dimension d < 4. There is an ongoing effort to extend 
these ideas to higher dimensions, but there is much room for improvement. In 
this chapter, we focus on the computational-complexity issues that arise when 
we try to compute Dedekind sums explicitly. In many ways, the Dedekind 
sums extend the notion of the greatest common divisor of two integers. 


8.1 Fourier—Dedekind Sums and the Coin-Exchange 
Problem Revisited 


Recall from Chapter 1 the Fourier—Dedekind sum (defined in (1.13)) 


b-1 kn 
b 
Sy, (@1,49,...,ag3b) = 
n ( 1,42; »“d, ) oe l hax 1 aes 1 kas ’ 
k=1 = b _ b ee a _— b 
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which appeared as a main player in our analysis of the Frobenius coin- 
exchange problem. We can now recognize the Fourier-Dedekind sums as 
honest finite Fourier series with period b. The Fourier-Dedekind sums unify 
many variations of the Dedekind sum that have appeared in the literature, and 
form the building blocks of Ehrhart quasipolynomials. For example, we showed 
in Chapter 1 that s,, (a1, @2,...,@a;0) appears in the Ehrhart quasipolynomial 
of the d-simplex 


{ (a1, sss, 2d41) € Rg 2 4%, +--+ + aqtat+ bta41 = i} . 
Example 8.1. We first notice that when n = 0 and d = 2, the Fourier-— 


Dedekind sum reduces to a classical Dedekind sum (which—finally—explains 
the name): for relatively prime positive integers a and b, 


so(a, 1; b) = 


1 1 1 1 1 
' 26-< 1—@ 7 oe - ae 


1 14eh\ (1424 1 p= 
5 (E8)(E a) 18 ea 


In the last step, we used the fact that multiplying the index k by a does 
not change the middle sum. This middle sum can be further simplified by 
recalling (1.8): 


D2 cara. kn ee 2b’ 


whence 


—1 . A 
ex. il 1+Eh*)\ (1+& 1 1 6-1 
Hoa t ae) pe (35) (= +37 3B a 


— tka tk b-1 


Example 8.2. The next special evaluation of a Fourier-Dedekind sum is 
very similar to the computation above, so we leave it to the reader to prove 
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(Exercise 8.7) that for a1, a2 relatively prime to b, 


b-1 


ab? (8.2) 


80 (a1, a2;b) = —s (a,a3*,b) + 


1 


dg =1 mod b. 


where az 


Returning to the general Fourier—Dedekind sum, we now prove the first of 
a series of reciprocity laws: identities for certain sums of Fourier—Dedekind 
sums. We first recall how these sums came up in Chapter 1, namely, from the 
partial fraction expansion of the function 


1 
P(2) (1 — 2%)--- (1 — 244) 2” 
Aj Apo An, By Bo Ba 
ss EE iy SE ae | 8. 
, oe OG ent ete (z—1)¢ 48) 
a,—1 az—1 ag—1 
Cie Con Cak 
oe ae 
k=1 a1 k=1 ae k=1 aa 
(Here we assume that a1, @2,...,@q are pairwise relatively prime.) Theorem 1.8 


states that with the help of the partial fraction coefficients B,,..., By and 
Fourier—Dedekind sums, we can compute the restricted partition function for 
A = {a1,02,...,@a}: 


pa(n) = —B, + By — +++ + (-1)*Ba t+ 8—n (a2, 43,-.., a} 41) 
+ Sn (41,03, @4,..-,@d} 42) +++ + $n (G1, 2,-..,@a—13 Ga) - 
It is not hard to see that B,, Bo,..., Ba are polynomials in n (Exercise 8.8), 


whence we call 
poly 4(n) := —B, + By —---4+ (—1)"Ba 


the polynomial part of the restricted partition function p4(n). These poly- 
nomials can be computed from first principles (and we give some of them 
in Example 8.3) but in fact, they are relatives of the Bernoulli polynomials, 
which we defined in (2.8) in Section 2.4: We define the Bernoulli—Barnes 
polynomials B(x) through the generating function 


gd et2 gk 
= BA uae 
(e%1% — 1) (e227 — 1)--- (e%4* — 1) » i (@) ki? 
where as usual, A = {a1,da2,...,aa}. (The Bernoulli polynomials are the 
special cases BY) Then 
—1)4 
poly a(n) = -Y— pan), (8.4) 
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an identity that can be proved most easily with a pinch of complex analysis 
(Exercise 14.3 in Chapter 14). 


Example 8.3. The first few expressions for polys,, .a,}(™) are 


poly s,,}(m) = ae 
POY (a ,a2}(%) = — +3(- + ~) 
1,42} ajag 2\a, ag)’ 
n? n 1 1 1 
POLY fa ,a2,as}(") 7 2ajaga3.— 2 (+ ay,a3 _.) i8-5) 

rg(eta+se t+ e+), 
12 \a, a2 a3 aga3 aja3”)— aya 
n3 

POLY fa, a9,03,04}(") = Gnas 

= ( 1 | 1 ) 
4 \ ajaa3 a4a2a4 a10304 420304 
n 1 1 1 1 1 1 

"A (— aja3. «@4a4—g03—S 24, =.) 

on ( ay a2 az | ) 
12 \aga3a4 944344 9 ay aga4 a1a2a3 

Fe. (= ay ay az, a2 , a 
24 \aga3 dod, 9 =9a3d4— a 3 ayaa a3a4 
a3 a3 a3 a4 a4 aA ) 

a142  @\44 + 42Q4 «=a1a2 «-a4a3.—SsA2. 43 

1 ( 1 | 1 1 % ~) 
8 \ar ag az a4 


We are about to combine the Ehrhart results of Chapter 3 with the partial 
fraction expansion of Chapter 1 that gave rise to the Fourier—Dedekind sums. 


Theorem 8.4 (Zagier reciprocity). For all pairwise relatively prime posi- 


tive integers a1,Q2,...,@d; 
80 (a2, 43,.--,@a}@1) + 80 (41, 43, @4,.-., Aa} 2) +°°- 
+ So (41, @2,---,@a—1; Ga) 


=1= POLY peices, cag} O) 


At first sight, this reciprocity law should come as a surprise. The Fourier— 
Dedekind sums can be long, complicated sums, yet when combined in this 
fashion, they add up to a trivial rational function in aj, a2,..., dq. 


Proof. We compute the constant term of the quasipolynomial p(n): 
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pa(0) = poly 4(0) + So (a2, a3,.--, da} @1) 
+ sq (a1, 43, 04,...,@g;G2) +++: +89 (a1, G2,...,@d_1; Ga) - 


On the other hand, Exercise 3.32 (the extension of Corollary 3.15 to Ehrhart 
quasipolynomials) states that p4(0) = 1, whence 


1 = poly 4 (0) + 50 (a2, a3,...,@a3 1) 
+ 89 (@1, 43, Q4,..-,@a}@2) + +++ + 89 (Q1, G2,..-,@d—1; Ga) - 


8.2 The Dedekind Sum and Its Reciprocity and 
Computational Complexity 


We derived in (8.1) the classical Dedekind sum s(a, b) as a special evaluation 
of the Fourier-Dedekind sum. Naturally, Theorem 8.4 takes on a particular 
form when we specialize this reciprocity law to the classical Dedekind sum. 


Corollary 8.5 (Dedekind’s reciprocity law). For all relatively prime pos- 
itive integers a and b, 


l1/a b 1 1 
s(a,b) + s(b, a) = 75 (G+ +5) Zz 


a 
Proof. A special case of Theorem 8.4 is 
So(a, 1; b) + so(b, a; 1) + so(1, b; a) =1- POly 44.1.5} (0) 


1 (3 3 a 1b 
=1 5 (24345453 ! ) 


3 1fa,b,1) 1.1 
~4 12\b a ab 4a  4b° 


Now we use the fact that s9(b,a;1) = 0 and the identity (8.1): 


1 1 
so(a, 1;b) = —s(a,b) + 17 BB 


Dedekind’s reciprocity law allows us to compute the Dedekind sum s(a, b) 
as quickly as the gcd algorithm for a and b, also known as the Euclidean 
algorithm. Let’s get a better feeling for the way we can compute the Dedekind 
sum by working out an example. We remind the reader of another crucial 
property of the Dedekind sums that we already pointed out in (7.5): s(a, b) 
remains invariant when we replace a by its residue modulo 0, that is, 


s(a, b) = s(a mod 8, b). (8.6) 
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Example 8.6. Let a = 100 and b = 147. Now we alternately use Corollary 8.5 
and the reduction identity (8.6): 


1 (100 147 1 1 
s(100, 147) = +, (= Te mn) 7 7 8(147, 100) 
1249 
1249 t /47..100. 4 1 
= 100, 4 
17640 (3 (= 47 +a) a 7) 
(a6 
= ~aq7a7 + 506.47) 
773 1/6 47 1 1 
- 4. 3(4 
20727 | 12 & 6 + aa) gee”) 
166 
= WA — (5, 6) 
166 1/5 6 1 1 
~ 441 (S(gtEts) 4 5(6:5)) 
2003 
4410 
_ 2003 1,1) 5 
~ 4410 4° 30° 12 
_ 5 
~ 882° 
In the last step, we used Exercise 7.21: s(1,k) = —t + - + +. A priori, 


s(100, 147) takes 147 steps to compute, whereas we were able to compute it 
in nine steps using Dedekind’s reciprocity law and (8.6). 


As a second corollary to Theorem 8.4, we mention the following three- 
term reciprocity law for the special Fourier—Dedekind sum s0(a, b;c). This 
reciprocity law could be restated in terms of the classical Dedekind sum via 
the identity (8.2). 


Corollary 8.7. For pairwise relatively prime positive integers a, b, and c, 


1/3 3 3 344 b Cc 
so(a, b; c) + so(c, a; b) + 80(b, 5a) = 1 13 (: a es er <). 


8.3 Rademacher Reciprocity for the Fourier—Dedekind 
Sum 


The next reciprocity law will be again for the general Fourier-Dedekind sums. 
It extends Theorem 8.4 beyond n = 0. 
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Theorem 8.8 (Rademacher reciprocity). Let a),a2,...,aq be pairwise 
relatively prime positive integers. Then for n = 1,2,...,(a1 +---+aa—1), 
Sn (@2,43,..-,Qd; 41) + Sn (Q1, 43, 04,-.-, Aa; 2) + °°: 
+ Sn (G1, G2,---,@d—13@a) = — Poly ga, ap,...,ag}(—”) - 


Proof. We recall the definition 
p(n) = #{(mi,...,ma) EZ: all m,; >0, mya, +--+ + maaa =n} 


of Exercise 1.33, that is, p%(m) counts the number of partitions of n using 
only the elements of A as parts, where each part is used at least once. This 
counting function is, naturally, connected to p4 through Ehrhart—Macdonald 
reciprocity (Theorem 4.1): 


that is, 


(—1)*"'p4(n) =poly 4(—n) + Sn (a2, a3,.-., 4a} a1) 
+ Sy (@1,43,4,...,@a} G2) ++++ + Sy (G1, G2,...,d—13 Aa) - 


On the other hand, by its very definition, 
py(n) =0 for n=1,2,...,(a, +---+ag—1), 
so that for those n, 


0 = poly 4(—n) + Sn (a2, a3,-.-,@a; a1) 


+ Sn (4,03, @a,.+.40g) 9) +-+++ 8, (01,09, ...,0g—13 Gg) « 


Just as Zagier reciprocity takes on a special form for the classical Dedekind 
sum, Rademacher reciprocity specializes for d = 2 to a reciprocity identity for 
the Dedekind—Rademacher sum 


The classical Dedekind sum is, naturally, the specialization ro(a, b) = s(a, b). 
To be able to state the reciprocity law for the Dedekind—Rademacher sums, 
we define the function 
1 if aln, 
Xaln) = | 


0 otherwise, 


which will come in handy as a bookkeeping device. 
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Corollary 8.9 (Reciprocity law for Dedekind—Rademacher sums). 
Let a and b be relatively prime positive integers. Then for n = 1,2,...,a+, 


n? nf 1 1 1 fab 1 
n 16 n b, = 
rm(a,b) + rn(b, a) 2ab 2 (= a 5) 12 (F a 7 =) 


+3((S2))+ (2) +@+@) 


1 
3 
1 
7 (1 + xa(n) + x0(n)), 


where a~'a =1 mod b and b-!b =1 mod a. 


This identity follows almost instantly once we are able to express the 
Dedekind—Rademacher sum in terms of Fourier-Dedekind sums. 


Lemma 8.10. Suppose a and b are relatively prime positive integers and 
n€Z. Then 


niami=-ntonied (2) +4 (S5)-2-4me, 


where a~'a = 1 mod b. 


Proof. We begin by rewriting the finite Fourier series (1.8) for the sawtooth 
function ((x)): 


Hence we also have 


b-1 


1 gkn ka-'n -1 1 7 1 
7 ka eae (7) + 5x00 "a Ge 


“(fob 


Now we use the convolution theorem for finite Fourier series (Theorem 7.10) 
for the functions 


1 b-— 1 b-1 gkn 
— — b 
: ere a and g(n) = 5 ae 


& k=1 


It gives 
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b-1 kn b-1 


1 
5B Oe) mam) = 


(C8) +Ee—m-8) (C2) do). 


We invite the reader to check (Exercise 8.12) that the sum on the right-hand 
side simplifies to 


nied = rune} (G2)) +3 (B))- Rrra 


Proof of Corollary 8.9. We use the following special case of Theorem 8.8: 
8n(a, 1;b) + 8,(1, a; b) + 8,(a,b; 1) = — poly ¢4.4,43(—”) 
we ca ga ty, de Dy anes eee ee 
~ Qab' 2\ab ab 12\a ‘bd b' a. ab)’ 


which holds for n = 1,2,...,a+6. Lemma 8.10 allows us to translate this 
identity into one for Dedekind—Rademacher sums: 


n? nfl 1 1 1 fab 1 
n ,b n(b, = t t t 
rm(a,6) + rn(b, a) 2ab 2 (5 a ;) 12 (5 +243) 


(CS) (C2) +@)+@) 


(1+ xa(n) + xs(n)). 


4 


The two-term reciprocity law allows us to compute the Dedekind— 
Rademacher sum as quickly as the gcd algorithm, just as obtained for the 
classical Dedekind sum. This fact has an interesting consequence: In Theo- 
rem 2.10 and Exercise 2.36, we showed implicitly (see Exercise 8.13) that 
Dedekind—Rademacher sums are the only nontrivial ingredients of the Ehrhart 
quasipolynomials of rational polygons. Corollary 8.9 ensures that these Ehrhart 
quasipolynomials can be computed almost instantly. 
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8.4 The Mordell—Pommersheim Tetrahedron 


In this section, we return to Ehrhart polynomials and illustrate how Dedekind 
sums appear naturally in generating-function computations. We will study 
the tetrahedron that historically first gave rise to the connection of Dedekind 
sums and lattice-point enumeration in polytopes. It is given by 


3 GY z 
— : > — a -< : 
P {(z,y,2) €R 2,y,z > 0, +942 <i}, (8.7) 


a tetrahedron with vertices (0,0,0), (a,0,0), (0, 6,0), and (0,0,c), where a, b,c 
are positive integers. We insert the slack variable n and interpret 


k ol 
Lp(t) = # { (ktm) €Z?:k.im>0, —+ 5 dee < i} 
a c 
a # {(k, l,m, n) €Z*:k,l,m,n>0, bck + acl +abm+n= abct } 
as the Taylor coefficient of 2°“ for the function 


> zbck » gael . zobm . Pi 


k>0 1>0 m>0 n>0 
1 
(2) 0 — 9) 0) (Ta) 


As we have done numerous times before, we shift this coefficient to the constant 
term: 


1 
Lp(t) = const € — gb) (1 — 2°) (1 — 2) (1 —2) =) ; 


To reduce the number of poles, it is convenient to change this function slightly; 
the constant term of 1/(1 — z°°)(1 — z2°)(1 — z%°)(1 — z) is 1, so that 


abet —] 


0 yaa) 


Lp(t) = const ( 


This trick becomes useful in the next step, namely expanding the function into 
partial fractions. Strictly speaking, we cannot do that, since the numerator 
is not a polynomial in z. However, we can think of this rational function as 
a sum of two functions. The higher-order poles of both summands that we 
will not include in our computation below cancel each other, so we can ignore 
them at this stage. The only poles of 


gy abet —1 


(1 — 2°) (1 — 22°) (1 — 2°) (1-2) 


(8.8) 
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are at the a*®, bt», ct” roots of unity and at 0. (As before, we do not have to 
bother with the coefficients of z = 0 of the partial fraction expansion.) To 
make life momentarily easier (the general case is the subject of Exercise 8.18), 
let’s assume that a, b, and c are pairwise relatively prime; then all the poles 
besides 0 and 1 are simple. The computation of the coefficients for z = 1 is 
very similar to what we did with the restricted partition function in Chapter 1. 
The coefficient in the partial fraction expansion of a nontrivial root of unity, 
say €*, is also computed practically as easily as in earlier examples: it is 


i 
oiler) (l= ch) 


(see Exercise 8.15). Summing this fraction over k = 1,2,...,a—1 gives rise 
to the Fourier—-Dedekind sum 


(8.9) 


po 1 
a 


(1 — eke) (1 — EF) 


t so (bc, 1; a). 
k=1 


Putting this coefficient and its siblings for the other roots of unity into the 
partial fraction expansion and computing the constant term yields (Exer- 
cise 8.15) 


st 


Ge Nant Se i= Lop ae eee, F 
4 4A\a' bec wWla'b §e¢ abc 


vv (so (be, 1; a) + 89 (ca, 1; b) + 80 (ab, 1; c))t 
+1. 


Lp(t) = 


We recognize instantly that the Fourier-Dedekind sums in this Ehrhart 
polynomial are in fact classical Dedekind sums by (8.1), and so we arrive at 
the following celebrated result. 


Theorem 8.11. Let P be given by (8.7) with a, b, and c pairwise relatively 
prime. Then 


iplt) = 8 , racket hp ! € | avoye 
1 /be ca ab 1 
+T5 (= a + a =z) s(Be,a) — 8 (ab) ~ (abye) ) O41. 


We finish this chapter by giving the Ehrhart series of the Mordell-Pommers- 
heim tetrahedron P. It follows simply from the transformation formulas 
(computing the Ehrhart numerator coefficients from the Ehrhart polynomial 
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coefficients) of Corollary 3.16 and Exercise 3.15, and hence the Ehrhart series 
of P naturally contains Dedekind sums. 


Corollary 8.12. Let P be given by (8.7) with a, b, and c pairwise relatively 


prime. Then 
_ AB + hg 22+ hi z+1 


Ehrp(z) = (2) ; 
where 
pe — abe ab+act+tbct+a+bt+e 1, 1 (=+F ab 1 ) 
? 6 4 2° 12\ 4 b °c | abe 
— s(bc,a) — s (ca, b) — s (ab, c) 
ng = AE Se oT (S tS eS) 
3 2 2 6\a b c abe 
— 2(s (bc, a) + s (ca, b) + s (ab, c)) 
pe — abe , abtactbe+atbte 245 (% e++e) 
. 6 4 EN bc abe 


— s(bc,a) — s(ca,b) — s(ab,c). 


It is a curious fact that the above expressions for hj, h3, and h3 are 
nonnegative integers due to Corollary 3.11. 


Notes 


1. The classical Dedekind sums came to life in the 1880s when Richard 
Dedekind (1831-1916)! studied the transformation properties of the Dedekind 


n-function [99] . 
n(z) =e 1? II (1-7) | 


a useful computational gadget in the land of modular forms in number theory. 
Dedekind’s reciprocity law (Corollary 8.5) follows from one of the functional 
transformation identities for 7. Dedekind also proved that 


12k s(h,k) =k+1—-2 (7) (mod 8), 


establishing a beautiful connection between the Dedekind sum and the Jacobi 
symbol (4) (the reader may want to consult the lovely Carus monograph 
Dedekind Sums, by Emil Grosswald and Hans Rademacher, where the above 
result is proved [199, p. 34]), and then used this identity to show that the 


1 For more information about Dedekind, see 
http://www-history.mcs.st-andrews.ac.uk/Biographies/Dedekind. html. 
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reciprocity law for the Dedekind sums (for which [199] contains several different 
proofs) is equivalent to the reciprocity law for the Jacobi symbol. 


2. The Dedekind sums and their generalizations appear in various contexts 
besides analytic number theory and discrete geometry. Other mathematical 
areas in which Dedekind sums show up include topology [140, 176, 255], 
algebraic number theory [174,222], and algebraic geometry [117]. They also 
have connections to algorithmic complexity [153] and continued fractions 
(17, 136, 183]. 


3. The reciprocity laws (Theorems 8.4 and 8.8) for the Fourier-Dedekind 
sums were proved in [37], though the connection to Bernoulli-Barnes poly- 
nomials was noticed only in [27]. Theorem 8.4 is equivalent to the reci- 
procity law for Don Zagier’s higher-dimensional Dedekind sums [255]. Corol- 
lary 8.7 (stated in terms of the classical Dedekind sum) is originally due 
to Hans Rademacher [197]. Theorem 8.8 generalizes reciprocity laws by 
Rademacher [198] (essentially Corollary 8.9) and Ira Gessel [119]. 


4. The Fourier—Dedekind sums form only one set of generalizations of the 
classical Dedekind sums. A long, but by no means complete, list of other 
generalizations is [8,9, 28,31, 33, 39, 51-53, 80, 81, 107, 108, 119, 128, 130, 152, 
174-176, 198, 241, 255]. 


5. The connection of Dedekind sums and lattice-point problems, namely 
Theorem 8.11 for ¢ = 1, was first established by Louis Mordell in 1951 [181]. 
Some 42 years later, James Pommersheim established a proof of Theorem 8.11 
as part of a much more general machinery [193]. In fact, Pommersheim’s work 
implies that the classical Dedekind sum is the only nontrivial ingredient one 
needs for Ehrhart polynomials in dimensions 3 and 4. 


6. We touched the question of efficient computability of Ehrhart (quasi-)po- 
lynomials in this chapter. Unfortunately, our current state of knowledge on 
generalized Dedekind sums does not suffice to make any general statement. 
However, Alexander Barvinok proved in 1994 [24] that in fixed dimension, 
the rational generating function of the Ehrhart quasipolynomial of a ratio- 
nal polytope can be efficiently computed. Barvinok’s proof did not employ 
Dedekind sums but rather used a decomposition theorem of Brion, which is 
the subject of Chapter 11. 


Exercises 


8.1. Show that s(a,b) = 0 if and only if a? = —1 mod b. 


162 8 Dedekind Sums 


8.2. Prove that 6b s(a,b) € Z. (Hint: Start with rewriting the Dedekind sum 
in terms of the greatest-integer function.) 


8.3. Prove that s(—a,b) = —s(a,b), for all a € Z with gcd(a, b) = 1. 


8.4. For a fixed prime modulus p, show that we can solve s(a,p) = s(b,p) in 
integers a,b if and only if a= b mod p or ab=1 mod p. 


8.5. Let a and b be any two relatively prime positive integers. Show that the 
reciprocity law for the Dedekind sums implies that for b= r mod a, 


12ab s(a, b) = —12ab s(r,a) + a? +b? — 3ab +1. 


Deduce the following identities: 


(a) For b= 1 moda, 


12ab s(a,b) = —a7b +b? +a? — 2b+1. 


(b) For b = 2 mod a, 


1 
12ab s(a,b) = 5a b ba? +b? ——b+1. 
(c) For b= —1 mod a, 
12abs(a, b) = a2b + a? + b? — Gab + 2b +1. 


8.6. Denote by f,, the sequence of Fibonacci numbers, defined as in Chapter 1 
by 
fi=fe=1 and fn42 = fn4it fn forn > 1. 


Prove that 
8 (fox, for+1) = 0 


and 
12 fori fox $ (for—1, for) = f3n—1 + fon — 3fox—1 far +1. 


8.7. de Prove (8.2): 


b-—1 
$0 (a1, a9; b) == 5 (a,az*,b) + Ab ) 


where a3 ‘a2 = 1 mod b. 


8.8. Prove that B,, Bo,..., By in the partial fraction expansion (8.3), 
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1 
f(z) = (1 — 2%)... (1 — 244) 2m 
_A _ A2 wirtents ted By, i By Peeacdt Ba 
Zz z zr z—-1 (z-1)? (z-1)4 
ai—1 ag-1 aq—-l 
Cik Cok Cak 
+ eo ee Or SS 
k=1 ay k=1 a2 k=1 Qa 
are polynomials in n (of degree less than d) and rational functions in aj,..., aa. 


8.9. & Verify the first few expressions for polys,, 4,}(m) in (8.5). 


8.10. Show that the Dedekind—Rademacher sum satisfies r_,(a, b) = rn (a, b). 


Faia 


8.11. The definition s(a,b) := i ((4*)) ((£)) of the Dedekind sum may 
be extended to encompass all pairs of integers (a,b) with b > 0. Show that if 
g := gcd(a, 6), then s(a,b) = (4, °). (This exercise shows that the Dedekind 
sum may also be thought of as a function of the fraction ¢, defined for any 


two integers a and b > 0.) 


8.12. & Show that 


E (((52)) +2e-- 3) (52) +203) 


m=0 


8.13. Rephrase the Ehrhart quasipolynomials for rational triangles given in 
Theorem 2.10 and Exercise 2.36 in terms of Dedekind—Rademacher sums. 


8.14. Rephrase the restricted partition piqy,-(m) from Example 1.9 in terms 
of Dedekind—Rademacher sums, and use an explicit formula for pyg.1,1}(m) to 
derive a formula for r,,(1, a). 


8.15. & Prove Theorem 8.11 by verifying (8.9) and computing the coefficients 
for z = 1 in the partial fraction expansion of (8.8). 


8.16. Let P be given by (8.7) with a = k and b= c= 1. Compute Lp(t) from 
first principles and conclude from this an alternative proof of the formula for 
s(1,%) given in Exercise 7.21. 


8.17. Let P be given by (8.7) with gcd(a,b) = 1 and c = 1. Use Dedekind’s 
reciprocity law (Corollary 8.5) to derive from Theorem 8.11 a formula for 
L(t) not depending on Dedekind sums. Can you prove this formula from 
first principles? 
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8.18. Generalize the Ehrhart polynomial of the Mordell-Pommersheim tetra- 
hedron given in Theorem 8.11 to the case that a, b, and c are not necessarily 
pairwise relatively prime. 


8.19. Compute the Ehrhart polynomial of the 4-simplex 


ZY X2 x3 TA 
£1, %2,23,24) € R4,: + 21), 
{( 1 2,43 4) >0 a T b Z T d- 


where a, b,c,d are pairwise relatively prime positive integers. (Hint: You may 
use Corollary 5.5 to compute the linear term.) 


8.20. This exercise gives an alternative proof (and extension) of Dedekind’s 
reciprocity law (Corollary 8.5) by way of a polynomial analogue of the 
Dedekind sum, the Carlitz polynomial 


b-1 
c(u, v; a, b) = S- ule] yk-1, 
k=1 
Here u and v are indeterminates, and a and Db are positive integers. 
(a) Continuing Exercise 3.7, express the integer-point transforms of the ratio- 
nal cones 
Ky = {\1(0, 1) a A2(a, b) i Ai, r2 > 0} 
Ko = {\1(1, 0) + A2(a, b) : A> 0, A2 > 0} 


in terms of Carlitz polynomials. Here a and 0 are relatively prime positive 
integers. 
(b) Verify that R2, is the disjoint union of K, and Ke, and use this fact to 


prove the Carlitz reciprocity law 
(v _ 1) c(u, U;a, b) + (u = 1) c(v, U; b, a) = ys tye-t =. 


(c) Apply the operators u Ou twice and v Ov once to Carlitz’s reciprocity law 
to deduce (once more) Corollary 8.5. 


Open Problems 


8.21. Find new relations between various Dedekind sums. 


8.22. It is known [31] that the Fourier-Dedekind sums are efficiently com- 
putable. Find a fast algorithm that can be implemented in practice. 


8.23. For a fixed integer modulus c, find all integer solutions a,b € Z to 
s(a,c) = (b,c). (See Exercise 8.4 for the case that c is prime.) 
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8.24. What is 
{s(a,b): a,bE Z, b> O}, 


the set of all rational numbers that occur as Dedekind sums? (It is known 
that this set is dense in R [17,136] and that every rational number between 
0 and 1 occurs as the fractional part of a Dedekind sum [120]. On the other 
hand, we know that 12 s(a,b) is never an integer unless s(a,b) = 0 [129].) 


Chapter 9 
Zonotopes 


“And what is the use of a book,” thought Alice, “without pictures or conversations?” 


Lewis Carroll (Alice in Wonderland) 


We have seen that the discrete volume of a general integral polytope may be 
quite difficult to compute. It is therefore useful to have an infinite class of 
integral polytopes whose discrete volume is more tractable, and yet they are 
robust enough to be “closer” in complexity to generic integral polytopes. One 
initial class of more tractable polytopes is that of parallelepipeds, and as we 
will see in Lemma 9.2, the Ehrhart polynomial of a d-dimensional half-open 
integer parallelepiped P is equal to vol(P) t?. In this chapter, we generalize 
parallelepipeds to projections of cubes. 


9.1 Definitions and Examples 


In order to extend the notion of a parallelepiped, we begin by defining the 
Minkowski sum of the polytopes P),P2,...,P» CR? as 


Pit Po+-+: + Ph t= {x1 + Xe t++-+Xn: x; © Pi}. 


For example, if P; is the rectangle [0,2] x [0,1] C R? and Pz is the line 
segment aye (3) C R?, then their Minkowski sum P,; + P2 is the hexagon 
whose vertices are (i. (als i (3). (ay and Gis as depicted in Figure 9.1. 
Parallelepipeds are special instances of Minkowski sums, namely those of line 
segments whose direction vectors are linearly independent, plus a point. 

We will also make use of the following handy construct: given a polynomial 
p(21, 22,---, 2a) in d variables, the Newton polytope N(p(z1, z2,.--,2a)) 
of p(21, 22,---, Za) is the convex hull of all exponent vectors appearing in the 
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Fig. 9.1 The Minkowski sum ([0, 2] x [0,1]) + ee ele 


nonzero terms of p(21, 22,---, 2a). For example, the hexagon in Figure 9.1 can 
be written as 


N (1 + 327 — eo — Sted + 342729 + wees) . 


It turns out (Exercise 9.1) that the constructions of Newton polytopes 
and Minkowski sums are intimately related. Namely, if p(z1, 22,..., 2a) and 
q(21, 22,---,2a) are polynomials, then 


N (p(a1, 22, 6 085 2a) Q(21; 205.4%, Za)) 
= N (p(a1, 2,205 za)) +N (q(a1, 22,205 za)) ¢ (9.1) 
Suppose that we are now given n line segments in R%, such that each line 
segment has one endpoint at the origin and the other endpoint is located at 


the vector uj; € R¢, for 7 =1,...,n. Then by definition, the Minkowski sum 
of these n segments is 


Z(u1, Ug,...,Un) = {x + xot-:-+x,: Xj = Aju; with Aj E (0, 1]}. 


Fig. 9.2 The zono- 


tope Z (4), (3), ())—a 
hexagon. 


Let’s rewrite the definition above in matrix form: 
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Z(U1,Ug,...,Un) = {Aiui + Agua + +++ +AnUn: O< A; < 1} 
Ai 
A2 
= (uy, U2 --: Up) 5 :0<A; <1 
Xn 
= A(0, 1)”, 


where A is the (dx n) matrix whose j‘" column is u,;. Just a bit more generally, 
a zonotope is defined to be any translate of A [0,1]”, ie., 


A(0,1]" +b, 


for some vector b € R¢. So we now have two equivalent definitions of a 
zonotope—the first is as a Minkowski sum of line segments, while the second 
is as a projection of the unit cube [0,1]”. Figures 9.2-9.4 show example 
zonotopes. 


Fig. 9.3 The rhombic do- 
decahedron is a zonotope. 


It is sometimes useful to translate a zonotope so that the origin becomes 
its new center of mass. To this end, we now dilate the matrix A by a factor 
of 2 and translate the resulting image so that its new center of mass is at the 
origin. Precisely, 


2A [0,1]" — (uy, +--+ + u,) 
= {2uj\, +--+» + 2u,An — (ur +--+ + un): 0<A; < 1} 
= A[-1,1]", (9.2) 
where the last step holds because —1 < 2A; -1 <1 when 0 < A; <1. In 


other words, (9.2) is a linear image of the larger cube [—1, 1]", centered at 
the origin. We thus define 
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2Z(tu;,+ug,...,tu,) := A[—1,1]”, 


and we see that Z(+u,,+up2,...,-+u,) is indeed a zonotope, by definition. 
We say that a polytope P is symmetric about the origin when it has the 
property that x € P if and only if —x € P. In general, a polytope P is called 
centrally symmetric if we can translate P by some vector b such that P+b 
is symmetric about the origin. The zonotope Z(+u,,+ue,...,+u,,) is an 
example of a polytope that is symmetric about the origin. We go through the 
full argument here, because it justifies our choice of a symmetric representation: 
Choose x € Z(+u1,+ug,...,+u,,), so that x = Ay, with y € [—1,1]”. Since 
—y € [-1,1]” as well, 


—x = A(-y) € Z2(4uj,+ug,...,+u,). 


It is, moreover, true that each face of a zonotope is again a zonotope and that 
therefore, every face of a zonotope is centrally symmetric (Exercise 9.2). 


Fig. 9.4 A few more zonotopes. 


9.2 Paving a Zonotope 


Next we will show that every zonotope can be neatly decomposed into a disjoint 
union of half-open parallelepipeds. Suppose that w1,w2,...,Wm € R@ are 
linearly independent, and let o1,02,...,0m € {+1}. Then we define 


ae = Pv + A2W2 ae eerse AmWm : : - Ay < 7 e ! \ . 


O<A; <1lifo;=1 

In plain English, Wyog;7°w.,, is a half-open parallelepiped generated by 
W 1, W2,---,Wm, and the signs 01,09,...,0m keep track of those facets of 
the parallelepiped that are included or excluded from the closure of the 
parallelepiped. 
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Lemma 9.1. The zonotope Z(uj,U2,...Un,) can be written as a disjoint 


union of translates of IQR; 7m, where {wi,W2,--.,Wm} ranges over 
all linearly independent subsets of {u,,U2,...U,}, each equipped with an 
appropriate choice of signs 01,02,...,;Om- 


Figure 9.5 illustrates the decomposition of a zonotope as suggested by 
Lemma 9.1. 


Proof of Lemma 9.1. We proceed by induction on n. If n = 1, Z(u1) is a line 
segment and 0 U (0, uj] is a desired decomposition. 
For general n > 1, we have by induction the decomposition 


Z(uW1,Ug,.-.,Un—-1) =I, UIbU::- UT, 


into half-open parallelepipeds of the form given in the statement of Lemma 9.1. 
Now we define the hyperplane H := {x € R¢: x-u, = 0} and let 7: R? > H 
denote the orthogonal projection onto H. Then 7(uz), 7(uz),...,7(Un—1) are 
line segments or points, and thus Z(7(uj),7(ug),...,7(Un_—1)) is a zonotope 
living in H. Once more by induction, we can decompose 


Z(m(u1), 7(u2),...,7(Un—1)) = ®, U®2,U ---U®,, 


into half-open parallelepipeds of the form given in Lemma 9.1. Each ®; is a 
half-open parallelepiped generated by some of the vectors 7(u1), (U2), .-., 
7™(Un—1); let 0; denote the corresponding parallelepiped generated by their 
unprojected counterparts. Then (Exercise 9.5) the desired disjoint union of 
Z(uj,Ug,...,U,) is given by I]; UI, U--- UH, UP, UP. U---U Pm, where 
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P; = ®; x (0, up). 


This decomposition lemma is useful, e.g., to compute the Ehrhart polyno- 
mial of a zonotope. To this end, we now first work out the Ehrhart polynomial 
of a half-open parallelepiped, which is itself a (particularly simple) zonotope. 


Lemma 9.2. Suppose w),W2,...,wq € Z® are linearly independent, and let 
II := {Ar wi + Agw2 +--+ AaWa : 0< 1, AQ,.++,Adq < 1}. 


Then 
# (11M Z*) = vol II = |det (wi,...,wa)|, 


and for every positive integer t, 
# (tILN Z*) = (vol II) ¢*. 


In other words, for the half-open parallelepiped II, the discrete volume 
# (tI1N Z*) coincides with the continuous volume (vol IT) t?. 


Proof. Because II is half open, we can tile the t'® dilate tH by t@ translates 
of II, and hence 


In@)=# ((INZ) =# (NZ). 


On the other hand, by the results of Chapter 3, L(t) is a polynomial with 
leading coefficient vol II = |det (wi,...,wa)|. Since we have equality of these 
polynomials for all positive integers t, it follows that 


# (IIN Z*) = vol I. 


Our proof shows that Lemma 9.2 remains true if we switch the < and < 
inequalities for some of the \, in the definition of I, a fact that we will use 
freely, e.g., in the following tool to compute Ehrhart polynomials of zonotopes, 
which follows from a combination of Lemmas 9.1 and 9.2. 


Corollary 9.3. Decompose the zonotope Z C R®¢ into half-open parallelepipeds 
according to Lemma 9.1. Then the coefficient c, of the Ehrhart polynomial 
Lz(t) =cat?+ceg_1 te 14+-++-+ 9 equals the sum of the (relative) volumes 
of the k-dimensional parallelepipeds in the decomposition of Z. 


9.3 The Permutahedron 


To illustrate that Corollary 9.3 is useful for computations, we now study a 
famous polytope, the permutahedron 


Pa := conv {(m(1) — 1, 7(2) —1,..., m(d) —1): rE Su}, 
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that is, the convex hull of (0,1,...,d—1) and all points formed by permuting its 
entries. It is not hard to show (see Exercise 9.9) that Pa is (d— 1)-dimensional 


and has d! vertices. Figures 9.6 and 9.7 show P3 and P, (the latter projected 
into R3). 


(0, 2, 1) 


(0, 1,2) 


Fig. 9.6 The permutahe- 


z (1,0, 2) 
dron P3. 


The reason that permutahedra make an appearance in this chapter is the 
following result. 


Theorem 9.4. 
Pa = [e1, €2] + [e1, 3] +--- + [eg-1, ea], 


in words: the permutahedron Pq is the Minkowski sum of the line segments 
between each pair of unit vectors in R¢. 


Proof. We make use of a matrix that already made its debut in Section 3.7: 
the permutahedron Py is the Newton polytope of the polynomial 


d— 
a : oi ; rl 
Ly Ly to 1 
det . 
d-1 ..d—2 
Ly vq tal 


One can see the vertices of Pg appearing in the exponent vectors by computing 
this determinant by cofactor expansion. Now we use Exercise 3.21: 


a 
ve : a ; x1 
Ly Ly +++ Ll 
det : : ei = ii (xj — x), 
: : ee 1<jcké<d 
d-1 | d-2 
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(1,0, 3, 2) 


(0, 1,3, 2) (1,0, 2,3) 


(0,3, 1,2) 


(0,3, 2, 1) 


(3,0, 2,1) (2,1,3,0) 


(2, 3,0, 1) 


Fig. 9.7 The permutahedron P1 (projected into R?). 


and so by (9.1), 
Pa =N (x41 — £2) +N (2x1 —2@3)+-+-+N(xq-1 — a). 


The right-hand side is the Minkowski sum of the line segments [e1, e2], [e1, es], 
sans [eq_1, eal. oO 


Now we will apply Corollary 9.3 to the special zonotope Pa. A forest is a 
graph that does not contain any closed paths. 


Theorem 9.5. The coefficient c, of the Ehrhart polynomial 
Lp,(t) = ca-1 t+ Ca—2 ae ere Co 


of the permutahedron Pq equals the number of labeled forests on d nodes with 
k edges. 


For example, we can compute Lp,(t) = 3t? + 3t +1 by looking at the 
labeled forests in Figure 9.8. 

An important ingredient to the proof of Theorem 9.5 is the following cor- 
respondence: to a subset S C {e; + e2,e1 + e3,...,ea_1 + ea} we associate 
the graph G's with node set [d] and edge set 


{jk: e; +e, € S}. 


We invite the reader to prove the following lemma (Exercise 9.13): 
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Z||/\\.A 
Nall. 


Fig. 9.8 The 3+341 
labeled forests on three 
nodes. 


Lemma 9.6. A subset S C {e; + €2,e; + €3,...,eg-1 + eg} is linearly inde- 
pendent if and only if Gg is a forest. 


Proof of Theorem 9.5. Let I consist of all nonempty linearly independent 
subsets of 
{e, + e2,e; +63,...,ea-1 + ea}. 


To each such subset S we will associate the half-open parallelepiped 


+ (0,e; +e,] 


ejte,ES 


(written as a Minkowski sum). It has relative volume 1 (Exercise 9.14). These 
half-open parallelepipeds give rise to the disjoint union 


Pa = Z(e1 +2, e1 + €3,...,e¢-1 + ea) 
=0uL{ S> Geter], (9.3) 
Sel \e;+e,€8 


as the reader should verify in Exercise 9.15. Corollary 9.3 now says that the 
coefficient c, of the Ehrhart polynomial 


Lp, (t) =ca-1t” | + ca_2t® 7 ++++ +e 


equals the sum of the relative volumes of the k-dimensional parallelepipeds in 
our zonotopal decomposition of Pq given in (9.3). As mentioned above, each 
of these volumes is 1, and Lemma 9.6 now gives the statement of Theorem 9.5. 


176 9 Zonotopes 


The leading coefficient of the Ehrhart polynomial of Pg is, of course, its 
relative volume, about which we can say slightly more. A tree is a connected 
forest. 


Corollary 9.7. The (relative) volume of the permutahedron Pq equals the 
number of labeled trees on d nodes. 


It turns out (Exercise 9.16) that there are precisely d?~? labeled trees on d 
nodes, so the relative volume of Py equals d¢?. 


Proof. By Theorem 9.5, the leading coefficient of Pg equals the number of 
labeled forests on d nodes with d— 1 edges. But (as the reader should show 
in Exercise 9.17) every such forest is connected. 


9.4 The Ehrhart Polynomial of a Zonotope 


Next we will generalize Lemma 9.2 to refine the formula of Corollary 9.3 
for Ehrhart polynomials of zonotopes. Lemma 9.2 computed the (continuous 
and discrete) volume of a half-open parallelepiped II spanned by d linearly 
independent vectors in Z7; this volume equals the number of integer points 
in II. We need a version of this lemma in which II is spanned by n linearly 
independent vectors in Z¢ where n < d. It turns out that the (continuous and 
discrete) volume of II is still given by the number of integer points in II, but 
this number is not quite as easily computed as in Lemma 9.2. 


Lemma 9.8. Suppose w1,W2,...,Wn € Z% are linearly independent, let 
II := {Arwi + A2w2 +-+-+AnWn : O< Ai, A2,.--,;An =< 1}, 


and let V be the greatest common divisor of alln x n minors of the matrix 
formed by the column vectors w,,W2,...,Wn. Then the relative volume of II 
equals V. Furthermore, 

# (IIN Z*) =V, 


and for every positive integer t, 
#EMNZ4) =V i". 


As with Lemma 9.2, the statement of Lemma 9.8 remains true if we switch 
the < and < inequalities for some of the A; in the definition of II. 


Proof. We will make use of the Smith normal form of an integer matrix (or 
more generally, a matrix with entries in an integral domain): more precisely, 
one proves in linear algebra that for every full-rank matrix A € Z™*"”, say 
for which m > n, there exist invertible matrices S € Z™*™ and T € Z"*” 
such that 
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d, 0 - 0 

0 de 
7 dn—1 0 
SAT= os a a 
0 ae 0 
0 yes 0 


with diagonal integer entries whose product did2---dn, equals the greatest 
common divisor of all n x nm minors of A. 

Geometrically, our matrices S and T transform II into the half-open 
rectangular parallelepiped 


TT := [0,d1) x [0,d2) x ++ x [0,dn) CRA 


in such a way that the integer lattice Z* is preserved; in particular, the relative 
volumes of II and II are equal, namely d;d2---d, = V. Furthermore, 


# (IN Z*) = #(fnz*) ee 


The remainder of our proof follows that of Lemma 9.2: since IT is half open, 
we can tile the t*” dilate tI by t” translates of II, and so 


# (tN Z*) = # (1 Z?*) ¢”. 


Lemma 9.8 is the crucial ingredient for the following refinement of Corol- 
lary 9.3. 


Theorem 9.9. Let Z := Z(uy,...,Un) be a zonotope generated by the integer 
vectors Uj,...,Un-. Then the Ehrhart polynomial of Z is given by 


Lz(t) = >_> mi(syel", 
8 


where S ranges over all linearly independent subsets of {u1,...,Un}, and 
m(S) is the greatest common divisor of all minors of size |S| of the matrix 
whose columns are the elements of S. 


For example, returning to the zonotope Z (CG), , (3) featured in Fig- 


3 
ure 9.5, we compute 
03 04 34 
det ({ > det ¢ 7 det € 7 


+ gcd(0, 4) t+ ged(3,3)t+ ged(4,1)t+1 
=—S7r etl. 


+ oon e 


Z2((),().()) = 
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We remark that the numbers m(S) appearing in Theorem 9.9 have ge- 
ometric meaning: if V(S) and L(S) denote respectively the vector space 
and lattice generated by S, then m(S) equals the cardinality of the group 
(V(S)NZ*) /L(S). 


Proof of Theorem 9.9. Decompose Z as described in Lemma 9.1. Corollary 9.3 
says that the coefficient c;, of the Ehrhart polynomial 


Lz(t) = Cq i? + Cqd-1 ae Spee ae Co 


equals the sum of the relative volumes of the k-dimensional parallelepipeds 
in this decomposition. Lemma 9.8 implies that these relative volumes are 
the greatest common divisors of the minors of the matrices formed by the 
generators of these parallelepipeds. 


Notes 


1. The word zonotope appears to have originated from the fact that for each 
line segment u,; that comes into the Minkowski-sum definition of a zonotope 
Z, there corresponds a “zone,” composed of all of the facets of Z that contain 
parallel translates of u;. This zone separates the zonotope into two isometric 
pieces, a “northern” hemisphere and a “southern” hemisphere, a property 
that is sometimes useful in their study. 


2. The combinatorial study of zonotopes, starting with Lemma 9.1, was 
brought to the forefront (if not initiated) by Peter McMullen and Geoffrey 
Shephard [170, 173, 219]. 


3. The permutahedron seems to have been first studied by Pieter Hendrik 
Schoute [212] in 1911. It tends to play a central role whenever one tries to 
“eeometrize” a situation involving the symmetric group. The permutahedron 
is a simple zonotope and as such quite a rare animal. It is not clear from the 
literature who first discovered Theorem 9.5, but we suspect it was Richard 
Stanley [230, Exercises 4.63 and 4.64]. There has been a recent flurry of research 
activity on generalized permutahedra (initiated by Alexander Postnikov [196]), 
which share many of the fascinating geometric and arithmetic properties of 
permutahedra. 


4. Corollary 9.3 and Theorem 9.9 are due to Stanley [228] (see also [226, 
Example 3.1] and [230, Exercise 4.31]), and we follow his proof closely in this 
chapter. The formula for the leading coefficient of the Ehrhart polynomial in 
Theorem 9.9, i.e., the volume of the zonotope, goes back to Shephard [219, 
equation (57)]. 
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5. Zonotopes have played a central role in the recently established theory 
of arithmetic matroids and arithmetic Tutte polynomials, and Corollary 9.3 
was restated in this language by Michele D’Adderio and Luca Moci [89, 
Theorem 3.2]. Ehrhart polynomials of zonotopes and their arithmetic Tutte 
siblings are in general harder to compute than what this chapter might convey; 
some examples for computable formulas (for arithmetic Tutte polynomials for 
the classical root systems) were given by Federico Ardila, Federico Castillo, 
and Michael Henley [10]. 


Exercises 


9.1. & Prove (9.1): if p(21, z2,...,2a) and q(21, 22,...,2q) are polynomials, 
then 


N (p(21, 225 +++) 2a) Q(Z1y 225 +++) Za)) 
= (p(z1, 22, i <a) +N (q(21, 22; i ~42)) : 


9.2. Prove that every face of a zonotope is a zonotope, and conclude that 
every face of a zonotope is centrally symmetric. 


9.3. Let P be a d-dimensional integral parallelepiped in R”. Prove that if 
there exists only one integer point x in the interior of P, then x must be the 
center of mass of P. 


9.4. Let P be a d-dimensional integral parallelepiped in R”. Prove that the 
convex hull of all the integer points in the interior of P is a centrally symmetric 
polytope. 


9.5. & Complete the proof of Lemma 9.1 by showing (using the notation from 
the proof) that Z(uj,u2,...,U,) equals the disjoint union of II,, He,..., I, 
and P1,P2, a » Pris 


9.6. (a) Show that for all real vectors u;,...,Un, w € R4, 
Ais s< yy —W) = Z(u,...,U,, W) — w, 


where the latter difference is defined as a translation of the zonotope 
Z(u1,...,Un, Ww) by the vector —w. 

(b) Show that given a zonotope Z, we may always choose all of the generators 
of a translate of Z in some half-space containing the origin on its boundary. 


9.7. Prove that the following statements are equivalent for a polytope P: 


(a) P is a zonotope; 
(b) every 2-dimensional face of P is a zonotope; 
(c) every 2-dimensional face of P is centrally symmetric. 
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9.8. Given a zonotope Z2(+u;, +ue,...,+u,) C R®%, consider the hyperplane 
arrangement H consisting of the n hyperplanes in R¢@ through the origin with 
normal vectors U;,U2,...,U,,. Show that there is a one-to-one correspondence 
between the vertices of Z(-+u,,+ue,...,--u,,) and the regions of H (i.e., the 
maximal connected components of R? \(JH). Can you give an analogous 
correspondence for the other faces of Z(+uj,+ug,...,+u,)? 


9.9. & Show that the dimension of the permutahedron 
Pa = conv {(a(1) — 1, 7(2) —1,...,2(d) — 1): 7 € Sa} 
is d— 1 and that P, has d! vertices. 


9.10. Show that the permutahedron Pq, is the image of the Birkhoff-von 
Neumann polytope 64 from Chapter 6 under a suitable linear map. 


9.11. Give a hyperplane description of the permutahedron Pg. 


9.12. According to Exercise 9.9, the permutahedron Py lies in a hyperplane 
H CR?*. Show that Pq tiles H. (Hint: begin by drawing the case d = 3. For 
the general case, the viewpoint of Exercise 9.8 might be useful.) 


9.13. Prove Lemma 9.6: A subset S C {e; + €2,e; +€3,...,ea-1 + ea} is 
linearly independent if and only if Gg is a forest. 


9.14. Let S be a linearly independent subset of 
{e, + e2,e; +63,...,ea-1 + ea}. 


Show that the half-open parallelepiped 


a (0,e; + ex] 


ejte,ES 
(written as a Minkowski sum) has relative volume 1. 
9.15. Prove (9.3): 
Pe=0u | J S- (0,e; + ex] 
Sel \e;+e,€S 
as a disjoint union. 
9.16. Prove that there are precisely d7~? labeled trees on d nodes. 


9.17. Show that every forest on d nodes with d— 1 edges is connected, i.e., it 
is a tree. 
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9.18. Given a graph G with node set [d], we define the graphical zonotope 
Zq as the Minkowski sum of the line segments [e;, e;] for all edges jk of G. 
(Thus the permutahedron P, is the graphical zonotope of the complete graph 
on d nodes.) Prove that the volume of Z@ equals the number of spanning 
trees of G. Give an interpretation of the Ehrhart coefficients of Zg, in analogy 
with Theorem 9.5.! 


9.19. Let G be a graph with node set [d], and let deg(j) denote the number 
of edges incident to j (the degree of the node j). The vector 


deg(G) := (deg(1), deg(2),..., deg(d)) 


is the labeled degree sequence of G. Let Zz be the zonotope generated by 
the vectors e; +e, forl <j <k<d. 


(a) Show that every labeled degree sequence of a graph with d nodes is an 
integer point in Zg. 

(b) If you know the Erdés—Gallai theorem (and if you don’t, look it up), prove 
that every integer point in Zz whose coordinate sum is even is a labeled 
degree sequence of a graph with d nodes. 


9.20. Let K be a compact, convex set in R?, symmetric about the origin, 
whose volume is greater than 2%. Prove that K must contain a nonzero integer 
point. (This result is known as Minkowski’s fundamental theorem, and it lies 
at the core of the geometry of numbers, as well as algebraic number theory. 
Its applications have shown how interesting and useful symmetric bodies can 
be.) 


Open Problems 


9.21. Classify the polynomials in Z[¢] that are Ehrhart polynomials of integral 
zonotopes. (This is hard: if the matrix defining a zonotope is unimodular, 
then its Ehrhart coefficients are the entries of the f-vector of the underlying 
matroid—see Note 5; the classification of f-vectors of matroids is an old and 
difficult problem.) 


9.22. The unlabeled degree sequence (often simply called degree se- 
quence) of a graph G is the vector deg(G) (defined in Exercise 9.19) rede- 
fined such that its entries are in decreasing order. How many distinct degree 
sequences are there for graphs on n nodes? (For known values for the first 


! An intimately related polytope, which is in fact older than the graphical zonotope associ- 
ated with G, is the acyclotope, the convex hull of net degree vectors of acyclic orientations 
of G. For example, the acyclotope of the complete graph on d nodes is the shifted permuta- 
hedron conv {(7(1), 7(2),..., 7(d)) : m € Sq}. It turns out that the acyclotope associated 
with G is the Minkowski sum of the line segments [e; — e,, e, — e;] for all edges jk of G. 
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few n, see [1, Sequence A004251]. The analogous question for labeled degree 
sequences was answered in [228]; that proof begins with Exercise 9.19.) 


9.23. Let K C R? be a d-dimensional convex body with the origin as its 
barycenter. If K contains only the origin as an interior lattice point, then 
vol(iK) < (arid where equality holds if and only if K is unimodularly 
equivalent to (d ++ 1)A, where A is the d-dimensional standard simplex from 
Section 2.3. (This is a conjecture of Ehrhart from 1964. Ehrhart himself 
proved this upper bound for all d-dimensional simplices, and also completely in 
dimension 2, but it remains open in general. The interested reader may consult 
[185] about the current state of Ehrhart’s conjecture and its relationship to 
the Ricci curvature of Fano manifolds.) 


Chapter 10 
h-Polynomials and h*-Polynomials 


Life is the twofold internal movement of composition and decomposition at once general 
and continuous. 


Henri de Blainville (1777-1850) 


In Chapters 2 and 3, we developed the Ehrhart polynomial and Ehrhart 
series of an integral polytope P and realized that the arithmetic information 
encoded in an Ehrhart polynomial is equivalent to the information encoded 
in its Ehrhart series. More precisely, when the Ehrhart series is written as a 
rational function, we introduced the name h*-polynomial for its numerator: 


Ehrp(z) = 1+ 5° Lp(t) = eer 


t>1 


Our goal in this chapter is to prove several decomposition formulas for h%(z) 
based on triangulations of P. As we will see, these decompositions will involve 
both arithmetic data from the simplices of the triangulation and combinatorial 
data from the face structure of the triangulation. 


10.1 Simplicial Polytopes and (Unimodular) 
Triangulations 


In Chapter 5—more precisely, in Theorem 5.1—we derived relations among 
the face numbers f;, of a simple polytope. The twin sisters of these Dehn— 
Sommerville relations for simplicial polytopes—all of whose nontrivial faces 
are simplices—were the subject of Exercise 5.9. It turns out they can be stated 
in a compact way by introducing the h-polynomial of the d-polytope P: 
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ga 
hei >) fe d=, 


k=-1 


where we set f_; := 1. The following is a nifty restatement of Exercise 5.9, as 
the reader should check (Exercise 10.1). 


Theorem 10.1 (Dehn—Sommerville relations). Jf P is a simplicial d- 
polytope, then hp is a palindromic polynomial. 


We need this simplicial version of the Dehn—Sommerville relations because 
it naturally connects to triangulations. The h-polynomial of P encodes combi- 
natorial information about the faces in the boundary of P, and so the natural 
setting in the world of triangulations will consist of a triangulation of the 
boundary of a polytope. By analogy with the face numbers of a polytope, 
given a triangulation T’ of the boundary OP of a given d-dimensional polytope 
P, we define f;, to be the number of k-simplices in T; they will be encoded, 
as above, in the h-polynomial 


d-1 
hr(z) := ia (1— a : 


k=-1 


where again we set f_1 := 1. The typical scenario we will encounter is that we 
are given a triangulation T' of the polytope P and then consider the induced 
triangulation 

{AeET: AC OP} 


of the boundary of P. 


Theorem 10.2 (Dehn—Sommerville relations for boundary triangu- 
lations). Given a regular triangulation of the polytope P, the h-polynomial 
of the induced triangulation of OP is palindromic. 


Proof. Given a regular triangulation T of P C R%, let Q C R**! be the 
corresponding lifted polytope as in (3.1). Choose a point v € P° and lift it 
to (v,h+2) € R41, where h is the maximal height among the vertices of Q. 
Let R be the convex hull of (v, +2) and the vertices of the lower hull of Q, 
and let 

S:= RN {x ER: aay, =h+1}. 


Figure 10.1 shows an instance of this setup. (The polytope S is a vertex 
figure of Q at v.) Exercise 10.2 says that S is a simplicial polytope whose 
face numbers f; equal the face numbers of the triangulation of OP induced 
by T. Thus the h-polynomial of this triangulation of OP equals hs(z), which 
is palindromic by Theorem 10.1. 


When we are given a triangulation T of a polytope (rather than of its 
boundary), we adjust our definition of the accompanying h-polynomial to 
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Fig. 10.1 The geometry of our proof of Theorem 10.2. 


d 
hp(z) = S- {2 aay, 


k=-1 


In general, there is no analogue of Theorem 10.2 for this h-polynomial, though 
an (important) exception is given in Exercise 10.3. 

There is a second reason for us to introduce the h-polynomial of a triangu- 
lation. We call a triangulation T unimodular if each simplex 


A = conv{vo,V1,---, Vk} €T 


has the property that the vectors v1 — vo, V2 — Vo, ---; Vk — Vo form a lattice 
basis of span(A) 9 Z4. In this case, we also call the simplex A unimodu- 
lar. One example of a unimodular simplex is the standard k-simplex A of 
Section 2.3. We recall that in this case, 
Ehra (z) : (10.1) 

ra(z) = —————— : 

AE = 2 
and we invite the reader to prove that the same Ehrhart series comes with 
every unimodular k-simplex (Exercise 10.4). 

Here is the main result of this section. 


Theorem 10.3. If P is an integral d-polytope that admits a unimodular 
triangulation T, then 
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hr(z) 
(GQ — 2a 


In words, the h*-polynomial of P is given by the h-polynomial of the triangu- 
lation T. 


Ehrp(z) = 


° 7 O---Q 

A vas 

41 7, .* 

/ I I \ 
=o oU Ua , a 1 0 
,o1, 1 7 

‘ua 1! 

/ 
co) fo) o---6 


Fig. 10.2 A triangulation of a hexagon and the corresponding decomposition (10.2). 


Proof. We begin by writing P as the union of all open simplices in T’, pictured 
in an example in Figure 10.2: 


P= | |x (10.2) 
AcT 


(here we are using Exercise 5.3). Since this union is disjoint, we can compute 


e dim A+1 
Ehrp(z)=1+ 5° Ehrae(z)=1+ >> ( ) , 


1l-z 
AcT AcT 


Here the last equality follows from the Ehrhart—Macdonald reciprocity theorem 
(Theorem 4.4) applied to the Ehrhart series (10.1) of a unimodular simplex. 
Since the above summands depend only on the dimension of each simplex, we 
can rewrite 


d _ k+1 d e k4+1 
Bhre(2) =1+ fe (T=) => a(S) 
k=0 


k=-1 


_ aha to) 


G— 21 G—2e1° 


Theorem 10.3 says something remarkable: the arithmetic of a polytope 
(its discrete volume) is completely determined by the combinatorics of the 
unimodular triangulation (its face structure). Unfortunately, not all integral 
polytopes admit unimodular triangulations—in fact, most do not (see Exer- 
cise 10.5). Our next goal is to find an analogue of Theorem 10.3 that holds 
for every integral polytope. 
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10.2 Fundamental Parallelepipeds Open Up, with an 
h-Twist 


Our philosophy in constructing a generalization of Theorem 10.3 rests in 
revisiting the cone over a given polytope from Chapter 3, using a decomposition 
of P into open simplices. 

Given an integral d-polytope P, fix a (not necessarily unimodular) trian- 
gulation T. As in (10.2), we can write P as the disjoint union of the open 
simplices in J’; from this point of view, the following definition should look 
natural. Given a simplex A € T, let II(A) be the fundamental parallelepiped 
of cone(A) (as defined in Section 3.3), and let 


Ba(z) := omaye(1,1,.--,1, 2). 


Thus Ba(z) is an “open” variant of hA(z) = onay(1,1,...,1,2), an equation 
we used several times in Chapters 3 and 4. 

Looking back at our proof of Theorem 10.3, it makes moral sense to include 
@ in the collection of faces of a triangulation of a given polytope, with 
the convention dim(@) := —1. (This explains in hindsight our convention 
f-1:= 1.) We will assume for the rest of this chapter that every triangulation 
T includes the “empty simplex” @. Along the same lines, we define Bg(z) := 1. 

We need one more concept to be able to state our generalization of Theo- 
rem 10.3. Given a simplex A € T, let 


link (A) = {Q ET: QNA=B, OC for some ® € T with AC 9H}, 
the link of A. In words, every simplex in linkr(A) is disjoint from A, yet it 


is the face of a simplex in T that also contains A as a face; see Figure 10.3 
for two examples. When the triangulation T is clear from the context, we 


link(1) 


Bd 


Fig. 10.3 The links of a vertex and an edge in a triangulation of the 3-cube. 


suppress the subscript and simply write link(A). We remark that by definition, 
link(@) = T. In general, link(A) consists of a collection of simplices in T, the 
largest dimension of which is d — dim(A) — 1 (Exercise 10.8), and so it is 
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reasonable to define 


d—dim(A)—1 


Aiink(A) (2) = > fe zktt (1 — 2) dim(A)-1-F 
k=—1 


where fj, denotes the number of k-simplices in link(A). The h-polynomial 
of link(A) encodes combinatorial data coming from the simplices in T that 
contain A. This statement can be made more precise, as we invite the reader 
to prove in Exercise 10.9: 


eee e dim(®)—dim(A) 
himecay(2) = (1 — 2)? >, (=) ; (10.3) 
bDA 


where the sum is over all simplices ® € T containing A. 


Theorem 10.4 (Betke-—McMullen decomposition of h*). Fix a trian- 
gulation T of the integer d-polytope P. Then 


acr Aiink(A) (2) Ba(z) 


Ehrp(z) = (1 — 2a" 


Before proving Theorem 10.4, we explain its relation to Theorem 10.3. 
If a simplex A € T is unimodular, then the corresponding fundamental 
parallelepiped II(A) of cone(A) contains the origin as its sole integer lattice 
point, and thus Ba(z) = 0, unless A = @. Thus if the triangulation T is 
unimodular, then the sum giving the h*-polynomial in Theorem 10.4 collapses 
to hiink(a)(z) Ba(z) = hr(z), and so Theorem 10.3 follows as a special case 
of Theorem 10.4. 


Proof of Theorem 10.4. We begin, as in our proof of Theorem 10.3, by writing 
P as the disjoint union of all open nonempty simplices in T as in (10.2), and 
thus, using Ehrhart—Macdonald reciprocity (Theorem 4.4), 


ae 
Ehrp(z)=1+ S> Ehrae(z)=1+ S¢ (-1)%™ pyaimcay MACS) 
AeT\{Z} AeT\{2} ( _ =) 
zdim(A)+1 l—z d—dim(A) p* ne 
(—2a 


Here h’,(z) denotes the h*-polynomial of the simplex A. Now we use Exer- 
cise 10.10: 


OA) (Z )=1+ ‘> onaye ( 


QCA 
enaey 


where the sum is over all nonempty faces of A. This identity specializes, if we 
choose z = (1,1,...,1, 2), to 
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ha(2) = S> Bale), 


QCA 


where now the sum is over all faces of A, including @ (recall that Bg(z) = 1). 
Substituting this back into (10.4) yields 


Wachee td =e) Voca Ba) 


Ehrp(z) =14 (i— 241 
7 Sey gon A (I -_ go) AEA Bo(4) i 
= (i — 2) (10.5) 


Recall that the polynomial Ba(z) encodes data about the lattice points in 
the interior of the fundamental parallelepiped of cone(A). The symmetry of 
this parallelepiped gets translated into the palindromicity of the associated 
polynomial (Exercise 10.7): 


Bale go Ba GE): (10.6) 


This allows us to rewrite (10.5) further: 


hi,(z) = is Puce ae al _ gee) » Bo (<) 
z 


AcT QCA 
= > gre _ aaa a > yo dun) —1 Bo (2) 
AcT QACA 
= » 2 Fae macau | _ x)? eA) Bo (ze) 
QET ADQ 
dim(A) —dim(Q) 
d—dim(Q) z 
= 001-2) Bo(z) > = 
eT ADQ 


Theorem 10.4 follows now with (10.3). 


10.3 Palindromic Decompositions of h*-Polynomials 


Our next goal is to refine Theorem 10.4 in the case that T comes from a 
boundary triangulation of P, for which we can exploit Theorem 10.2, or rather 
its analogue for links, which is the subject of Exercise 10.12. To keep our 
exposition accessible, we will discuss only the case that P contains an interior 
lattice point; the identities in this section and the next have analogues without 
this condition, which we will address in the notes at the end of the chapter. 


Theorem 10.5. Suppose P is an integral d-polytope that contains an interior 
lattice point. Then there exist unique polynomials a(z) and b(z) such that 
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The identities for a(z) and b(z) say that a(z) and b(z) are palindromic 
polynomials; the degree of a(z) is necessarily d (because the constant coefficient 
of a(z) is hi (0) = 1), while the degree of b(z) is d—1 or less. In fact, b(z) 
can be the zero polynomial—this happens if and only if P is the translate of 
a reflexive polytope, by Theorem 4.6. 


Proof. After a harmless lattice translation, we may assume that 0 € P°. Fix 
a regular boundary triangulation T of P and let 


To := TU {conv(A,0): Ae T}. 


Thus Jo is a triangulation of P whose simplices come in two flavors: those on 
the boundary of P and those of the form conv(A, 0) for some A € T. (We say 
that To comes from coning over the boundary triangulation T.) Figure 10.4 
shows an example. Perhaps not surprisingly, the two kinds of simplices in To 


iS 


Fig. 10.4 A triangulation To and its two types of simplices; the middle picture shows the 
boundary triangulation T. 


are related: Exercise 10.11 says that for every nonempty simplex A € T, 
Pinky, (A) (2) = Aiink-r, (conv(A,0)) (z) = Iiink (A) (z) : (10.7) 
Thus Theorem 10.4 gives in this case 


h(z) = > Minky, (a) (2) Ba(2) 


AETo 


a Aiinkr (A) (2) (Ba(z) + Beonv(A,0) (2)) : 
AcT 


I 


Now let 
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a(z) = » Pinks (A) (Zz) Ba(z) (10.8) 
AcT 
b(2) = = J Pinker(ay(2) Beonv(,0)(2)- (10.9) 


AcT 


The fact that each Boonv(A,o)(z) has constant term zero ensures that b(z) 
is a polynomial (of degree at most d— 1). Furthermore, by Exercise 10.12 
and (10.6), 


1 : 1 F 1 
d ae he d—dim(A)—-1). aa dim(A)+1 faa 
so) = Zeon) 
AceT 
= > Aiink (A)(Z) Ba(z) = a(z), 


AcT 


and b(z) = 24-1 b(4) follows analogously. Thus our setup gives rise to the 
decomposition 
his(z) = a(z) + 20(2) 


with palindromic polynomials a(z) and b(z), and Exercise 10.13 says that 
this decomposition is unique. (Note that this means, in particular, that every 
boundary triangulation yields the same decomposition.) 


10.4 Inequalities for h*-Polynomials 


The proof of Theorem 10.5 has a powerful consequence due to the following fact, 
whose proof would lead us too far astray (though we outline a self-contained 
proof in Exercises 10.16—10.18). 


Theorem 10.6. The h-polynomial of a link in a regular triangulation has 
nonnegative coefficients. 


Looking back how the polynomials a(z) and 6(z) were constructed 
(see (10.8) and (10.9)), we can thus deduce the following corollary: 


Corollary 10.7. The polynomials a(z) and b(z) appearing in Theorem 10.5 
have nonnegative coefficients. 


Naturally, the fact that the coefficients of a(z) and b(z) have nonnegative 
coefficients can be translated into inequalities among the coefficients of the 
accompanying h*-polynomials: the following corollary of Corollary 10.7 is 
easily proved (Exercise 10.14). 


Corollary 10.8. Suppose P is an integral d-polytope that contains an interior 
lattice point. Then its h*-polynomial h(x) = hia’ + he_yat 1 +--+ +h§ 
satisfies 
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Ag+ +.--+he,, > hath yt--+he_; for 0<j <([$]-1 (10.10) 
and 


Ao thptes +h; Shgphy ate +hy for OS7sd. (10.01) 


Notes 


1. Theorem 10.3 is only one indication of how important (and special) unimod- 
ular triangulations are; see [98, Chapter 9] for more. If an integral polytope 
P admits a unimodular triangulation, then it is integrally closed: for every 
positive integer k and every integer point x € kP, there exist integer points 
Y1,Y2,---,¥~ € P such that y; + yo +---+y~ = x.! (See Exercise 10.6.) 
There are integrally closed polytopes that do not admit a unimodular trian- 
gulation. These two notions are part of an interesting hierarchy of integral 
polytopes, with connections to commutative algebra and algebraic geometry; 
see, e.g., [73]. 


2. Theorem 10.4 is due to Ulrich Betke and Peter McMullen, as is Theo- 
rem 10.5, published in an influential paper [54] in 1985. Corollary 10.7 should 
perhaps have been in [54] (the nonnegativity of the h-polynomials in Betke— 
McMullen’s formulas was established in the 1970s), but it appeared only—in 
the guise of Corollary 10.8—in the 1990s, in papers by Takayuki Hibi [133] and 
Richard Stanley [227]. Neither paper gives the impression that the inequalities 
of Corollary 10.8 follow immediately from Betke-McMullen’s work, though 
both hold without the condition of the existence of an interior lattice point 
(here d has to be replaced by the degree of h%(z) in (10.11)) and in situations 
more general than the realm of Ehrhart series. 


3. Theorem 10.5 and Corollary 10.7 are not the end of the story. Building on 
work of Sam Payne [188] (which gave a multivariate version of Theorem 10.4), 
Alan Stapledon [234] generalized Theorem 10.5 and Corollary 10.7 to arbitrary 
integral polytopes. His theorem says that if hg(z) has degree s (recall from 
Chapter 4 that in this case, we say P has degree s), then there exist unique 
polynomials a(z) and b(z) with nonnegative coefficients such that 


(Ltzt---+24%) hp(z) =a(z) +2718 d(z), (10.12) 


a(z) = 24a(4), b(z) = 24 0(4), and, writing a(z) = agz4+ag_12z¢-'+---+a9, 


z 


1 Integrally closed polytopes are also said to have the integer decomposition prop- 
erty. There is a related notion for integral polytopes, namely that of normality. For full- 
dimensional polytopes, the terms integrally closed and normal are equivalent, but this is 
not the case when the subgroup Dex, yePnzd Z(x — y) of Z@ is not a direct summand of 


Z4. For more about this subtlety (and much more), see [73,88]. 


Exercises 193 
l=aj<a,<a; for 2<j<d-1. (10.13) 


These inequalities imply Corollary 10.8 without the condition that P contain 
an interior lattice point (again here d has to be replaced by s in (10.11)); see 
Exercise 10.15. Stapledon has recently improved this theorem further, giving 
infinitely many classes of linear inequalities among the h*-coefficients [232]. 
This exciting new line of research involves additional techniques from additive 
number theory. He also introduced a weighted variant of the h*-polynomial that 
is always palindromic, motivated by motivic integration and the cohomology 
of certain toric varieties [233]. One can easily recover hy (x) from this weighted 
h*-polynomial, but one can also deduce the palindromicity of both a(a) and 
b(x) as coming from the same source (and this perspective has some serious 
geometric applications). 


Exercises 


10.1. & Show that the f-vector and the h-vector of a simplicial d-polytope 
are related via 


k Has k d-j 
hp = So(-)3 es ) fj-1 and tet = ys 6 ~?) hj, 


j=0 j=0 


and conclude that the Dehn—Sommerville relations in Exercise 5.9 are equiva- 
lent to Theorem 10.1. 


10.2. & As in the setup of our proof of Theorem 10.2, let Q C R**! be the 
lifted polytope giving rise to a regular triangulation T of the polytope P C R¢, 
choose v € P®, and lift it to (v,h +2) € R@, where h is the maximal height 
among the vertices of Q. Let R be the convex hull of (v, +2) and the vertices 
of the lower hull of Q, and let 


S:=RN{xeR*: cay, =h+1}. 


Prove that S is a simplicial polytope whose face numbers f; equal the face 
numbers of the triangulation of OP induced by T. 


10.3. Given a triangulation T of the boundary of a d-polytope P and a point 
v € P°, construct a triangulation K of P consisting of T with the simplices 
conv(A,v) for all A € T appended; i.e., the new triangulation K comes from 
coning over T’. Prove that 


10.4. & Show that if A is a unimodular k-simplex, then 
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1 


Ehra(z) = Goat’ 


10.5. Show that every integral polygon admits a unimodular triangulation. 
Give an example of an integral d-polytope that does not admit a unimodular 
triangulation for any d > 3. 


10.6. Recall from the notes that an integral polytope P is integrally closed 
if for every positive integer k and every integer point x € kP, there exist 
integer points y1, Y2,---, yx € P such that y; +yo+---+y,z =x. Prove that 
every integral polytope that admits a unimodular triangulation is integrally 
closed. 


10.7. & Prove (10.6): Ba(z) = ztm4)+1B, (4). 


10.8. & Let T be a triangulation of the d-polytope P. Given A € T, show 
that the largest dimension occurring among the simplices in link(A) is d— 
dim(A) — 1. 


10.9. & Prove (10.3): given a triangulation T of a d-dimensional polytope, 
then for every simplex A € T, 


e dim(®)—dim(A) 
Irine(ay(2) = (1 — z)48) SO (; = :) 
bDA 


where the sum is over all simplices ® € T that contain A. 


10.10. & For an integral simplex A, let II(A) denote the fundamental paral- 
lelepiped of cone(A). Show that 


ona)(z) = 1+ S- ona) (2); 
ACA 


where the sum is over all nonempty faces of A. 


10.11. & Prove (10.7): Fix a regular triangulation To of a polytope P with 0 € 
P° whose vertices are 0 and the vertices of P, and let T := {A € Tp: A C OP}. 
Then for every nonempty A € T, 


Miink(A) (z) = hiink(conv(A,0)) (2) . 


10.12. & Let A be a simplex in a regular boundary triangulation of a d- 
dimensional polytope. Prove that hyink(a)(z) is palindromic. (Hint: establish 
a one-to-one correspondence between link(A) and the boundary faces of a 
polytope of dimension d — dim(A) — 1, respecting the face relations. ) 


10.13. & Let p(z) be a polynomial of degree d. Show that there are unique 
polynomials a(z) and b(z) such that 
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p(z) = a(z) + 2b(z), 

a(z) = 24 a(4), and 6(z) = z4-' 0(4). 

10.14. & Consider the polynomial decomposition hZ(z) = a(z) + zb(z) of 

an integral d-polytope that contains an interior lattice point, given in The- 


orem 10.5. Prove that the fact that a(z) has nonnegative coefficients im- 
plies (10.10): 


Agthit---+hi,, > hg+he_yt---+h3_; for O<j7 < [8] -1, 
and the fact that 6(z) has nonnegative coefficients implies (10.11): 
ho thy te + hy Sho thy toothy for O<7 <a. 


10.15. Derive the analogue of Corollary 10.8 without the condition that P 
contain an interior lattice point, using (10.12) and (10.13). 


10.16. A shelling of the d-polyhedron P is a linear ordering of the facets 
F,, F2,...,Fm of P such that the following recursive conditions hold: 


(1) F; has a shelling. 
(2) For every j > 1, (Fy U Fg U--- UF3_1) NF; is of dimension d— 2 and 


(Fi U Fa UU Fy_1) NF = Gr UGo Us UG, 


where Gj UG2 U--- UG, U-++ UG», is a shelling of F;.? 


Q 


g 


AAaRAd 


g 


Fig. 10.5 Constructing a line shelling of a pentagon. 


Given a d-polytope P C R%, choose a generic vector v € R@ (e.g., you can 
choose a vector at random). If we think of P as a planet and we place a hot-air 
balloon on one of the facets of P, we will slowly see more and more facets, 
one at a time, as the balloon rises in the direction of v (see Figure 10.5). Here 


2 Note that this condition implies that (F, U Fg U---U F;~1) NF; is connected for d > 3. 
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seeing means the following: we say that a facet F is visible from the point 
x ¢ P if the line segment between x and any point y € F intersects P only 
in y. 

Our balloon ride gives rise (no pun intended) to an ordering Fi, F2,..., Fm 
of the facets of P. Namely, 7; is the facet on which we are beginning our 
journey, F2 is the next visible facet, etc., until we are high enough so that 
no more visible facets can be added. At this point, we “pass through infinity” 
and let the balloon approach the polytope planet from the opposite side. The 
next facet in our list is the first one that will disappear as we move toward the 
polytope, then the next facet that will disappear, etc., until we are landing 
back on the polytope. (That is, the second half of our list of facets is the 
reversed list of what we would get had we started our ride at Fi.) 

Prove that this ordering F,,F2,...,Fm of the facets of P is a shelling 
(called a line shelling). 


10.17. Suppose P is a simplicial d-polytope with facets F,, F2,...,Fm, or- 
dered by some line shelling. We collect the vertices of F; in the set Fj. Let 
F, be the set of all vertices v € Fj such that F; \ {v} is contained in one of 
F, Fo,..., Fj;—1 (which define the facets coming earlier in the shelling order). 


(a) Prove that the new faces appearing in the j*" step of the construction of 


the line shelling, i.e., the faces in F; \ (#1 U F2 U---U F;_1), are precisely 
those sets conv(G) for some F; C G C Fj. 
(b) Show that 


where hm denotes the number of sets F ; of cardinality m. 
(c) Conclude that h,, = h,, and thus that h,, > 0. 


10.18. Given a regular triangulation of the polytope P, prove that the h- 
polynomial of the induced triangulation of OP has nonnegative coefficients, 
as does the h-polynomial of the link of every simplex in this triangulation. 


10.19. Show that the construction in Exercise 10.17 gives the following char- 
acterization of the h-polynomial of a simple polytope P: fix a generic direction 
vector v and orient the graph of P so that each oriented edge (viewed as a 
vector) forms an acute angle with v. Then h;, equals the number of vertices 
of this oriented graph with in-degree k. 


Open Problems 


10.20. Which integral polytopes admit a unimodular triangulation? (The 
property of admitting unimodular triangulations is part of an interesting 
hierarchy of integral polytopes; see [73].) 
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10.21. Find a complete set of inequalities for the coefficients of all h*- 
polynomials of degree < 3, i.e., piecewise linear regions in R® all of whose 
integer points (h},h5,h3) come from an h*-polynomial of a 3-dimensional 
integral polytope. (See also Open Problem 3.43.) 


10.22. A linear inequality ap%) +a,%, +...ag%q > 0 is balanced if ag +a, + 
--+-+ aq = 0. Find a complete set of balanced inequalities for the coefficients 
of every h*-polynomial of degree < 6. (For degree < 5, see [232].) 


10.23. Prove that every integrally closed reflexive polytope has a unimodal 
h*-polynomial. More generally, prove that every integrally closed polytope has 
a unimodal h*-polynomial. Even more generally, classify integral polytopes 
with a unimodal h*-polynomial. (See [12, 63, 74,210] for possible starting 
points.) 


Chapter 11 


The Decomposition of a Polytope into 
Its Cones 


Mathematics compares the most diverse phenomena and discovers the secret analogies that 
unite them. 


Jean Baptiste Joseph Fourier (1768-1830) 


In this chapter, we return to integer-point transforms of rational cones and 
polytopes and connect them in a magical way that was first discovered by 
Michel Brion. The power of Brion’s theorem has been applied to various 
domains, such as Barvinok’s algorithm in integer linear programming, and to 
higher-dimensional Fuler—Maclaurin summation formulas, which we study in 


Chapter 12. In a sense, Brion’s theorem is the natural extension of the familiar 


; ; a : b b+1_ ya : : F 
finite geometric series identity )7,,_, 2” = —* to higher dimensions. 


11.1 The Identity “>? <2 2” = 0”... 
Or “Much Ado About Nothing” 


We begin gently by illustrating Brion’s theorem in dimension 1. To this end, 
let’s consider the line segment Z := [20,34]. We recall that its integer-point 
transform lists the lattice points in Z in the form of monomials: 


or(z) = , gm — 720 4 gl 4 784 
meEINZ 


Already in this simple example, we are too lazy to list all integers in Z and 
use --- to write the polynomial oz. Is there a more compact way to write 
oz? The reader might have guessed it even before we asked the question: this 
integer-point transform equals the rational function 
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gi 938 


EK) 1l-—z 


This last sentence is not quite correct: the definition of oz(z) yielded a 
polynomial in z, whereas the rational function above is not defined at z = 1. 
We can overcome this deficiency by noticing that the limit of this rational 
function as z + 1 equals the evaluation of the polynomials oz(1) = 15, by 
L’Hopital’s rule. Notice that the rational-function representation of oz has 
the unquestionable advantage of being much more compact than the original 
polynomial representation. The reader who is not convinced of this advantage 
should replace the right vertex 34 of Z by 3400. 

Now let’s rewrite the rational form of the integer-point transform of Z 
slightly: 


720 _ 35 720 234 


tae Gag at (11.1) 


or(z) = 


There is a natural geometric interpretation of the two summands on the 
right-hand side. The first term represents the integer-point transform of the 
interval [20, 00): 


opom(2)= Yo = 


m>20 


The second term in (11.1) corresponds to the integer-point transform of the 
interval (—oo, 34]: 


234 
F (00,34) (2) = os zg = f=" 
m<34 z 
So (11.1) says that on a rational-function level, 
020,00) (2) + O(—00,34] (2) = F [20,34] (2) - (11.2) 
——————_® @ 
20 34 
® o—__—_——_ 
20 34 
—__—_———_-# ee ——™_ _ —_—_—_- 
20 34 


Fig. 11.1 Decomposing a line segment into two infinite rays. 


This identity, which we illustrate graphically in Figure 11.1, should come 
as a mild surprise. Two rational functions that represent infinite sequences 
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somehow collapse, when being summed up, to a polynomial with a finite 
number of terms. We emphasize that (11.2) does not make sense on the level 
of infinite series; in fact, the two infinite series involved here have disjoint 
regions of convergence. 

Even more magical is the geometry behind this identity: on the right-hand 
side, we have a polynomial that lists the integer points in a finite interval P, 
while on the left-hand side, each of the rational generating functions represents 
the integer points in an infinite ray that begins at a vertex of P. The two 
half-lines will be called verter cones below, and indeed, the remainder of 
this chapter is devoted to proving that an identity similar to (11.2) holds in 
general dimension. 

We recall a definition that was touched on only briefly in Exercises 3.19 
and 9.8: A hyperplane arrangement H is a finite collection of hyperplanes. 
An arrangement H is rational if all its hyperplanes are rational, that is, if each 
hyperplane in H is of the form {x ER: aya, + aovg +--+ + Gata = b} for 
some @1,42,-..,@¢q,0 € Z. An arrangement H is called a central hyperplane 
arrangement if its hyperplanes meet in (at least) one point. 

Our next definition generalizes (finally) the notion of a pointed cone, defined 
in Chapter 3. A cone is the intersection of finitely many half-spaces of the 
form 

{xe R?: a1%1 + AQ%Q +++: + Adqla < b} 


for which the corresponding hyperplanes 
{xe R?: a421 + ag%Q +---+ agra = b} 


form a central arrangement. This definition extends that of a pointed cone: a 
cone is pointed if the defining hyperplanes meet in exactly one point. Thus 
nonpointed cones are cones that contain a line. A cone is rational if all of 
its defining hyperplanes are rational. Cones and polytopes are special cases 
of polyhedra, which are convex bodies defined as the intersection of finitely 
many half-spaces. Our next goal is to prove that every rational cone that 
contains a line essentially has integer-point transform equal to zero, in a sense 
that we have to make mathematically precise. 


11.2 Technically Speaking... 


To understand integer-point transforms of general cones, we need a small 
aspect of the theory of modules, the technically necessary tools to make things 
rigorous. To see what kind of problems might arise, consider the situation 
of Alice who decides to glue together a few pointed cones A; to form some 
(not necessarily convex) polyhedral region K, and her friend Bob decides 
to glue together some other pointed cones B;, to form the same region K. 
Why should it be true that the sum of the rational functions that we have 
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associated with the pointed cones A; equals the sum of the rational functions 
that we have associated with the pointed cones B;? We must account for this 
inherent potential ambiguity and show that it is in fact unambiguous—Alice 
and Bob will always get the same rational functions. 

We recall that a module M over a ring R is a generalization of a vector 
space over a field, where the salient new feature is that the field of scalars is 
replaced by a ring of scalars. Consider, for example, the set of all polynomials 
in d variables with coefficients in Z, under addition. We can think intuitively 
of this particular module over the ring Z as a discretized version of a vector 
space over the field of real numbers. 

The only module notions that are important to us in this chapter are that 
M forms an Abelian group under addition, that we can multiply elements of R 
by elements of M in a sensible way (e.g., compatible with ring multiplication 
in R), and that we preserve the definition of a linear map ¢ between two 
modules M and N over the same ring R, namely, ¢ satisfies 


o(m, +7 mz) = o(m1) +7 d(m2) 


for every m,,m2 € M andre R. 

From the point of departure for this chapter, it might not come as a surprise 
that the two modules we will use are formed by Laurent series and by rational 
functions. We cannot multiply two Laurent series in a meaningful way (and 
a reader who has not thought about this should construct an example in 
which things go wrong), but we can multiply a Laurent series by a Laurent 
polynomial, i.e., a finite Laurent series. This motivates the choice of ring with 
which we will be operating, namely, the ring 


C [2*] C2; eee ses | 


of Laurent polynomials in d variables with coefficients in C. We will con- 
sider two modules over C [z*]: on the one hand, the set Cla of all (for- 
mal) Laurent series of rational cones in R¢, and on the other hand, the set 
C(z) := C(x, z2,..-, 2a) of rational functions in d variables. 

In Chapter 3, we saw that certain Laurent series, namely integer-point 
transforms of rational simplicial cones, naturally evaluate to rational functions, 
by a natural tiling argument. The following lemma says that we can think of 
this evaluation as one instance of a linear map between the modules Clg and 
C(z) over C [z*]. 


Lemma 11.1. There is a unique linear map ¢: Cla > C(z) that maps an 
integer-point transform (viewed as a Laurent series) of a rational simplicial 
cone 


{v + Arwi + Agwe +++ +ARWR: A1,A2,---,An > OF} CR? 


to the rational function 
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on(Z) 
(1—2™1) (1 — gW2).-.(1 — gwr) ) 


where oy (Zz) ts the integer-point transform of the half-open parallelepiped 
II := {v + Apw, + Agwe + +++ +ALWe :0< Ay, A2;-++,Ak < 1}. (11.3) 


To understand the significance of this lemma, we note that CLg is generated 
by the integer-point transforms of rational simplicial cones, since every rational 
cone can be triangulated by rational simplicial cones. Thus Lemma 11.1 says 
that the rational forms of integer-point generating functions of simplicial 
cones, which we derived in Chapter 3, extend uniquely to rational forms of 
integer-point generating functions of all rational cones. 


Proof of Lemma 11.1. Let 
K:= {v + Apwi + Aowe +--+ + ARWe 3 Ai, A2,-+-,Ak = OF 


be a simplicial rational k-cone in R¢. For clarity, let’s consider ox¢(z) as an 
element in CLg, and for this Laurent series we proved in Chapter 3 the identity 


(1 — 2) (1—2™)--- (1 —2**) ox(z) = on(z), (11.4) 


where II is given by (11.3). We remark that (11.4) is an identity in the module 
CLa over C [z*]: the factor (1 — z¥!) (1 — 22) ---(1 — z*) on the left-hand 
side and the right-hand side oy(z) are Laurent polynomials in C [z*], whereas 
ox(z) is in CLg. 

If K is now a general rational cone, we can triangulate it into simplicial cone, 
each of which comes with a version of (11.4). The integer-point transform 
ox«(z) € Cla of our general cone K can naturally be written in an inclusion— 
exclusion form as a sum (with positive and negative terms) of integer-point 
transforms of these simplicial cones and their faces, which are also simplicial 
cones. Applying the same sum to the identities of the form (11.4) for these 
simplicial cones gives an identity 


g(Z) on (2) = f(z) 


for some Laurent monomials f(z) and g(z). This yields our sought-after linear 
map: we define 


DRORAN ay = 


That this map ¢ is linear follows by construction, and that it is unique 
follows from the uniqueness of the rational-function form of ox¢(z) when K is 
simplicial. 
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The linear map ¢ in Lemma 11.1 allows us to make precise our aforemen- 
tioned statement that integer-point transforms of rational cones that contain 
a line are philosophically equal to zero. 


Lemma 11.2. Let ¢: Cla > C(z) be the linear map in Lemma 11.1, and let 
K CR? be a rational cone that contains a line. Then 


$(oK(z)) =0. 


Proof. Let K CR? be a rational cone that contains a line. This implies that 
there exists a vector w € Z? \ {0} such that w +K =X. Translated into the 
language of Laurent series, this means that z2Y ox (z) = ox(z), and thus, since 
@ is linear, 


Z” o(ox(Z)) = ¢(oK(z))- 


But this gives the identity (1 — z”) ¢ (ox (z)) = 0 in the world C(z) of rational 
functions. Since 1 — z™ is not a zero divisor in this world, we conclude that 


b (ox (z)) = 0. 


11.3 Tangent Cones and Their Rational Generating 
Functions 


The goal of this section, apart from developing the language that allows us 
to prove Brion’s theorem, is to prove a sort of analogue of (11.2) in general 
dimension. 

We now attach a cone to each face F of P, namely its tangent cone, 
defined by 


Kp :={x+A(y—x): x€F, ye P,X€Rso}. 


We note that Kp = span. For a vertex v of P, the tangent cone K,, is often 
called a vertex cone; it is pointed, and we show an example in Figure 11.2. 
For a k-face F of P with k > 0, the tangent cone K is not pointed. For 
example, the tangent cone of an edge of a 3-polytope is a wedge. 


Lemma 11.3. For every face F of P, we have spanF C Ke. 


Proof. As x and y vary over all points of F, the points x + \(y — x) vary 
over span F. 


We note that this lemma implies that K¢ contains a line unless F is a 
vertex. More precisely, if K+ is not pointed, it contains the affine space span F, 
which is called the apex of the tangent cone. (A pointed cone has a point as 
apex. We also remark that Kz is the smallest convex cone with apex span F 
that contains P.) Consequently, Lemma 11.2 implies the following: 
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Fig. 11.2 A vertex cone. 


Corollary 11.4. Let 6: CLa — C(z) be the linear map in Lemma 11.1, and 
let F be a face of P that is not a vertex. Then 


¢ (0K, (2)) =0. 


11.4 Brion’s Theorem 


The following theorem is a classical identity of convex geometry named after 
Charles Julien Brianchon (1783-1864)! and Jorgen Pedersen Gram (1850- 
1916).? It holds for every convex polytope. However, its proof for simplices 
is considerably simpler than that for the general case. We need only the 
Brianchon—Gram identity for simplices, so we restrict ourselves to this special 
case. (One could prove the general case along similar lines as below; however, 
we would need some additional machinery not covered in this book.) The 
indicator function 1g of a set S C R® is defined by 


1 if xeES, 
1s(x) ={4 if x¢ 8. 


Theorem 11.5 (Brianchon—Gram identity for simplices). Let A be a 
d-simplex. Then 


La(x) = SO (-D)%™F 1x (x); 


FCA 


where the sum is taken over all nonempty faces F of A. 


Proof. We distinguish between two disjoint cases: whether or not x is in the 
simplex. 


1 For more information about Brianchon, see 
http://www-history.mcs.st-andrews.ac.uk/Biographies/Brianchon. html. 


? For more information about Gram, see 
http://www-history.mcs.st-andrews.ac.uk/Biographies/Gram. html. 
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Case 1: x € A. Then x € K for each F of A, and the identity becomes 


dim A 
ies Cie 
FEL, k=0 


This is the Euler relation for simplices, proved in Exercise 5.6. 


Case 2: x ¢ A. Then there is a unique minimal face F of A (minimal with 
respect to dimension) such that x € Kz and x € Kg for all faces G C A that 
contain F (Exercise 11.4). The identity to be proved is now 


j= Ss. (11.5) 


GQF 


The validity of this identity again follows from the logic of Exercise 5.6; the 
proof of (11.5) is the subject of Exercise 11.6. 


Corollary 11.6 (Brion’s theorem for simplices). Suppose A is a ratio- 
nal simplex. Then we have the following identity of rational functions: 


ox(z)= So, (2). 


v a vertex of A 


Proof. We translate the Brianchon—Gram theorem into the language of inte- 
ger-point transforms: we sum both sides of the identity in Theorem 11.5 for 
all m € Z4, 


Sots = Ya ie 


meéZ? meZt FCA 


which is equivalent to 


oa(z) = So (-1)9™7 ox, (2): 


FCA 


Now we apply the linear map ¢ : CLag > C(z) from Lemma 11.1 to this 
identity: 


$(oa(z)) = Do (-1)™* 6 (oxz(z)). 


FCA 


The left-hand side is the Laurent polynomial o,(z). On the right-hand side, 
we can see the rational-function versions (which we may still denote by 
ox,,(z)) of the integer-point transforms of the (simplicial) vertex cones Ky, 
and the remaining terms, namely ¢(ox,-(z)) for nonvertices F, disappear by 
Corollary 11.4. 


Now we extend Corollary 11.6 to every convex rational polytope: 
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Theorem 11.7 (Brion’s theorem). Suppose P is a rational convex poly- 
tope. Then we have the following identity of rational functions: 


op(z)= Sax, (2). (11.6) 


v a vertex of P 


Proof. We use the same irrational trick as in the proofs of Theorems 3.12 
and 4.3. Namely, we begin by triangulating P into the simplices A,, Ao,..., Am 
(using no new vertices). Consider the hyperplane arrangement 


H :={spanF : F is a facet of Ay, Ao,..., or Am}. 


We will now shift the hyperplanes in , obtaining a new hyperplane arrange- 
ment H*. Those hyperplanes of H that defined P now define, after shifting, 
a new polytope that we will call P*"'. Exercise 11.8 ensures that we can shift 
H. in such a way that: 


e no hyperplane in 1° contains a lattice point; 

e Het yields a triangulation of Ps; 

e the lattice points contained in a vertex cone of P are precisely the lattice 
points contained in the corresponding vertex cone of Ps, 


This setup implies that 


e the lattice points in P are precisely the lattice points in Ps; 

e the lattice points in a vertex cone of P*™* can be written as a disjoint 
union of lattice points in vertex cones of simplices of the triangulation that 
Het induces on P**, 


The latter two conditions, in turn, mean that Brion’s identity (11.6) follows 
from Brion’s theorem for simplices: the integer-point transforms on both sides 
of the identity can be written as a sum of integer-point transforms of simplices 
and their vertex cones. 


11.5 Brion Implies Ehrhart 


We conclude this chapter by showing that Ehrhart’s theorem (Theorem 3.23) 
for rational polytopes (which includes the integral case, Theorem 3.8) follows 
from Brion’s theorem (Theorem 11.7) in a relatively straightforward manner. 


Second Proof of Theorem 8.23. As in our first proof of Ehrhart’s theorem, 
it suffices to prove Theorem 3.23 for simplices, because we can triangulate 
any polytope (using only the vertices). So suppose A is a rational d-simplex 
whose vertices have coordinates with denominator p. Our goal is to show that 
for a fixed 0 <r < p, the function L,a(r + pt) is a polynomial in ¢; this means 
that La is a quasipolynomial with period dividing p. 
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First, if r = 0, then La(pt) = Lpa(t), which is a polynomial by Ehrhart’s 
theorem (Theorem 3.8), because pA is an integer simplex. 
Now we assume r > 0. By Theorem 11.7, 


do 
mé€(r+pt)ANZ4 


= lim or +p a 2) (11.7) 
=> lim ye O(r+pt)Ky (z) ‘i 


Zl 
v vertex of A 


I 


La(r + pt) 


We used the limit computation for the integer-point transform o(,4p4)a rather 
than the evaluation 0(,+4,1)a(1), because that evaluation would have yielded 
singularities in the rational generating functions of the vertex cones. Note 
that the vertex cones Ky are all simplicial, because A is a simplex. So suppose 


Ky = {v + Arwi t+ Aqw2 +--+ + AGWa : A1,A2,---,Aa = OF; 
then 


(r+ pt)Ky = {(r + pt)v + Arwy + Agw2 +--+ + AqWa: Ai, A2,---,Aa = OF 
= tpv + {rv + Aw + Agwe +--+ + Aawa: Ai, A2,---,Aa = OF 
=tpv+rk,. 


What is important to note here is that pv is an integer vector. In particular, 
we can safely write 
O(rtpt)Ky (2) = 2?’ OK, (Z) 


(we say safely because tpv € Z%, so z'?Y is indeed a monomial). Now we can 
rewrite (11.7) as 


La(r+pt)=lim SY) 2% oy x, (2). (11.8) 
ee v vertex of A 
The exact form of the rational functions o;<,(z) is not important for the 
purposes of the proof, except for the fact that they do not depend on t. We 
know that the sum of the generating functions of all vertex cones is a Laurent 
polynomial in z; that is, the singularities of the rational functions cancel. To 
compute La(r + pt) from (11.8), we combine all of the rational functions on 
the right-hand side over one denominator and use L’Hopital’s rule to compute 
the limit of this one huge rational function. The variable t appears only in 
the simple monomials z’?Y, so the effect of L’Hépital’s rule is that we obtain 
linear factors of t every time we differentiate the numerator of this rational 
function. At the end, we evaluate the remaining rational function at z = 1. 
The result is a polynomial in t. 
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Notes 


1. Theorem 11.5 (in its general form for convex polytopes) has an interesting 
history. In 1837, Charles Brianchon proved a version of this theorem involving 
volumes of polytopes in R? [65]. In 1874, Jorgen Gram gave a proof of the 
same result [121]; apparently, he was unaware of Brianchon’s paper. In 1927, 
Duncan Sommerville published a proof for general d [223], which was corrected 
in the 1960s by Victor Klee [150], Branko Griinbaum [126, Section 14.1], and 
many others. 


2. Michel Brion discovered Theorem 11.7 in 1988 [66]. His proof involved the 
Baum-Fulton—Quart Riemann—Roch formula for equivariant K-theory of toric 
varieties. A more elementary proof of Theorem 11.7 was found by Masa-Nori 
Ishida a few years later [142]. Our approach in this chapter follows [40]. 


3. Around the time Theorem 11.7 was conceived, Jim Lawrence [161] and 
Alexander Varchenko [249] discovered a companion theorem—it also gives the 
integer-point transform of a rational polytope as a sum of rational functions 
stemming from cones based at the vertices of the polytopes, but slightly 
different from the vertex cones. Both the Brion and the Lawrence—Varchenko 
theorems can be neatly understood using the theory of valuations, which we 
do not use here (see, for example, [160]). 


4. As we have already remarked earlier, Brion’s theorem led to an efficient 
algorithm by Alexander Barvinok to compute Ehrhart quasipolynomials [24]. 
More precisely, Barvinok proved that in fixed dimension, one can efficiently? 
compute the Ehrhart series )>,..) Lp(t) 2’ as a short sum of rational functions.* 
Brion’s theorem essentially reduces the problem to computing the integer- 
point transforms of the rational tangent cones of the polytope. Barvinok’s 
ingenious idea was to use a signed decomposition of a rational cone to compute 
its integer-point transform: the cone is written as a sum and difference of 
unimodular cones, which we will encounter in Section 12.4 and which have 
a trivial integer-point transform. Finding a signed decomposition involves 
triangulations, Minkowski’s theorem on lattice points in convex bodies (see, 
for example, [82, 178, 186, 220]), and the LLL algorithm, which finds a short 
vector in a lattice [163]. At any rate, Barvinok proved that one can find a 
signed decomposition quickly, which is the main step toward computing the 
Ehrhart series of the polytope. Barvinok’s algorithm has been implemented 
in the software packages barvinok [250] and LattE [94,96, 154]. Barvinok’s 
algorithm is described in detail in [22]. 


3 Efficiently here means that for every dimension, there exists a polynomial that gives an 
upper bound on the running time of the algorithm when evaluated at the logarithm of the 
input data of the polytope (e.g., its vertices). 

4 Short means that the set of data needed to output this sum of rational functions is also 
of polynomial size in the logarithm of the input data of the polytope. 
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Exercises 


11.1. Prove that the cones in R@ are precisely the sets of the form 
vt{xeR?: Ax <0} 
for some v € R¢ and A € R™*¢, 


11.2. Recall that the orthogonal complement A+ of an affine space A is 
defined by 
At := {xe R*: x-v =0 for allve A}. 


Prove that for every face F of a polytope, (span F)> Ks is a pointed cone. 
(Hint: Show that if H is a defining hyperplane for F, then HM (span F)* is 
: L 

a hyperplane in the vector space (span F)~.) 

11.3. Let P C R?@ be a full-dimensional polytope. A face F of P is visible 
from the point x € R¢ if the line segment between x and any point y € F 
intersects P only in y. (We used this notion already in Exercise 10.16.) Show 
that F is visible from x if and only if x ¢ Kz. 


11.4. & Suppose A is a simplex and x ¢ A. Prove that there is a unique 
minimal face F C A (minimal with respect to dimension) such that the 
corresponding tangent cone K contains x. Show that x € Kg for all faces 
G C A that contain F, and x ¢ Kg for all other faces G. 


11.5. Show that Exercise 11.4 fails to be true if A is a quadrilateral (for 
example). Show that the Brianchon—Gram identity holds for your quadrilateral. 


11.6. & Prove (11.5): for a face F of a simplex A, 


a (i? —(0 
UoF 
where the sum is taken over all faces of A that contain F. 


11.7. Give a direct proof of Brion’s theorem for the 1-dimensional case. 


11.8. & Provide the details of the irrational-shift argument in the proof of 
Theorem 11.7: Given a rational polytope P, triangulate it into the simplices 
Aj, Ao,..., Am (using no new vertices). Consider the hyperplane arrangement 


H :={spanF : F is a facet of Ay, Ao,..., Am}. 


We will now shift the hyperplanes in H, obtaining a new hyperplane arrange- 
ment Ht. Those hyperplanes of H that defined P now define, after shifting, 
a new polytope that we will call Ps. Prove that we can shift H in such a 
way that: 
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e no hyperplane in 1°" contains any lattice point; 

e HM€ yields a triangulation of Ps, 

e the lattice points contained in a vertex cone of P are precisely the lattice 
points contained in the corresponding vertex cone of Ps", 


11.9. & Prove the following “open polytope” analogue for Brion’s theorem: If 
P is a rational convex polytope, then we have the identity of rational functions 


ope(z) = > OKs (Z)- 


v a vertex of P 


11.10. Prove the following extension of Ehrhart’s theorem (Theorem 3.23): 
Suppose P C R?¢ is a rational convex polytope and q is a polynomial in d 


variables. Then 
Os Salm) 
metPnZ4 


is a quasipolynomial in t. (Hint: Modify the proof in Section 11.5 by intro- 
ducing a differential operator.) 


Chapter 12 
Euler—Maclaurin Summation in R? 


All means (even continuous) sanctify the discrete end. 


Doron Zeilberger 


Thus far we have often been concerned with the difference between the discrete 
volume of a polytope P and its continuous volume. In other words, the quantity 


S- 1- fray. (12.1) 


mePnZ4 


which is by definition Lp(1) — vol(P), has been on our minds for a long time 
and has arisen naturally in many different contexts. An important extension is 
the difference between the discrete integer-point transform and its continuous 


sibling: 
a enn f eY *dy , (12.2) 


mePnZ4 P 


where we have replaced the variable z that we have commonly used in generat- 
ing functions by the exponential variable (21, z2,...,Za) = (e*, e”,...,e"). 
Note that on setting x = 0 in (12.2), we get the former quantity (12.1). Rela- 
tions between the two quantities > .epagae™™ and J, e”*dy are known as 
Euler—Maclaurin summation formulas for polytopes. The “behind-the-scenes” 
operators that are responsible for affording us with such connections are the 
differential operators known as Todd operators, whose definition utilizes the 
Bernoulli numbers in a surprising way. 
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12.1 Todd Operators and Bernoulli Numbers 


Recall the Bernoulli numbers B;, from Section 2.4, defined by the generating 
function 


We now introduce a differential operator via essentially the same generating 


function, namely 
Todd, := S7(-1)*2# (4 ; (12.3) 
— kl \dh) * 


This Todd operator is often abbreviated as 


d 
Todd, = ah 
l1-—e7— 


? 


=e 


but we should keep in mind that this is only a shorthand notation for the 
infinite series (12.3). We first show that the exponential function is an eigen- 
function of the Todd operator. 


Lemma 12.1. For z © C \ {0} with |z| < 27, 


Todd, e?" = 


Proof. 


Bef a\* 
Todd, e*" = }0(—1)* (=) ezh 


k! 
k>0 : 
B 
yeh k?Pk 
ip DG 
k>0 
a ezh Te 
e*—1 


The condition |z| < 27 is needed in the last step, by Exercise 2.14. 


The Todd operator is a discretizing operator, in the sense that it transforms 
a continuous integral into a discrete sum, as the following theorem shows. 


Theorem 12.2 (Euler—Maclaurin in dimension 1). For alla << be Z 
and z €C with |z| < 27, 
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b+hy, 
Todd;,, Todd,,, | edz 


a—h2 


Proof. Case 1: z = 0. Then e** = 1, and so 


b+hy, 
Todd), Toddp, Z edz 


a—hg 


b+hi 
= Toddp, Toddh, / dx 


a—hg 


hi=h2=0 

= b—a+ Todd, hy + Todd, hal, p,—0 
1 
2 


1 
=0b a hits thet 


hi=h2=0 
=b-a+l 


by Exercise 12.1. Since > eX — h —a+1, we have verified the theorem 
in this case. 


Case 2: z #0. Then 


b+hy 
Todd, Toddp, / 


a—h2 


1 
e**dx = Toddp, Toddp, e Ce _ a) 


l 


1 
= (Toddp, e*t*"1 — Todd), e7*~*"?) 
z 


e2 ere 
rs Toddn, ew Toddp, e772 
e% 2 ez e%4 x eo he 
~ z l-—e- z l-e ’ 
where the last step follows from Lemma 12.1. Hence 
b+hy 1 1 
Todd), Todd), / edz =e + e**% 
a—hie 1—e-* 1 — e? 
hi=h2=0 


We will need a multivariate version of the Todd operator later, so we define 
for h = (hi, ha, sang hn), 
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m JO. 


Toddy := it 


oe 
j=l 1- exp (-#) 


keeping in mind that this is a product over infinite series of the form (12.3). 


12.2 A Continuous Version of Brion’s Theorem 


In the following two sections, we develop the tools that, once fused with the 
Todd operator, will enable us to extend Euler—Maclaurin summation to higher 
dimensions. We first give an integral analogue of Theorem 11.7 for simple 
rational polytopes. We begin by translating Brion’s integer-point transforms 


op(z) = >> ox, (2) 
v a vertex of P 
into an exponential form: 
op(expz) = > ox,(expz), 
v_ a vertex of P 


where we used the notation expz = (e*!,e”,...,e*4). For the continuous 
analogue of Brion’s theorem, we replace the sum on the left-hand side, 


op(expz) = S- (expz)™ = S- exp(m-z), 
mePnZ4 mePnZ4 
by an integral. 


Theorem 12.3 (Brion’s theorem: continuous form). Suppose P is a 
simple rational convex d-polytope. For each vertex cone Ky of P, fix a set of 


generators w1(v), Wo(v),...,Wa(v) € Z?. Then 
Hive = a4 exp (v - z) |det (wi(v),...,wa(v))| 
I, ° . va 2 of P Ts (Wr (v) : Z) 


for all z such that the denominators on the right-hand side do not vanish. 


Proof. We begin with the assumption that P is an integral polytope; we will 
see in the process of the proof that this assumption can be relaxed. Let’s 
write out the exponential form of Brion’s theorem (Theorem 11.7), using the 
assumption that the vertex cones are simplicial (because P is simple). By 
Theorem 3.5, 


S> expm-z) = SD Sep Se) Oke) (12.4) 


d 
mePnZ4 v a vertex of P he (1 — exp (Wi, (v) . Z)) 
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where 
ly = {Arwi(v) + A2gwa(v) +--+: +Aawa(v) : O< AL, A2,.--,Aa < 1} 


is the fundamental parallelepiped of the vertex cone Ky. We would like to 
rewrite (12.4) with the lattice Z4 replaced by the refined lattice (4z)°, because 
then, the left-hand side of (12.4) will give rise to the sought-after integral by 
letting n approach infinity. The right-hand side of (12.4) changes accordingly; 
now every integral point has to be scaled down by 1: 


S> exp(m-z) = exp(v - 2) Dimemynze exP (7 * 2) 


mePn(4z)* va vertex of P aa (1 — exp (mae . z) ) 
(12.5) 
The proof of this identity is in essence the same as that of Theorem 3.5; we 
leave it as Exercise 12.2. Now our sought-after integral is 


1 
[exw z)dx = lim — s exp(m - z) 
P n> 


oo nd 
mePn(2z)* 
2. ok exp(V-Z) )imemynzé —- eg z) 
=Jimo 
eee va vertex of P Th 1 (1 — exp (ae wt 2) 


(12.6) 


At this point, we can see that our assumption that P has integral vertices 
can be relaxed to the rational case, since we may compute the limit only for 
n’s that are multiples of the denominator of P. The numerators of the terms 
on the right-hand side have a simple limit: 


Jim, exp(v - Z) S- exp (= . 2) = exp(v- z) yy 1 


meéll,nZ4 meéll,nZ4¢ 


= exp (v - z) |det (wi(v),...,wa(v))|, 


where the last identity follows from Lemma 9.2. Hence (12.6) simplifies to 


ee exp (v - z) |det (wi(v),...,wa(v))| 
[exe \d ye 


d 4 ; : 
v a vertex of P Tee limy—so0 n (1 — exp (ue : z) ) 


Finally, using L’Hopital’s rule, 


Jim_n (1 — exp (me -z)) = —w;,(v)-z, 


and the theorem follows. 
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It turns out (Exercise 12.6) that for each vertex cone Ky, 


7 sipGe<a emi) exp (v- Z) [et Wa V)akees'Wat¥))| (12.7) 
K 


v [par (we(v) + 2) 


and Theorem 12.3 shows that the Fourier—Laplace transform of P equals the 
sum of the Fourier—Laplace transforms of the vertex cones. In other words, 


I exp(x- z) dx = ‘> [. exp(x-z) dx. 


v a vertex of P 


We also remark that |det (w)(v),...,wa(v))| has a geometric meaning: it 
is the volume of the fundamental parallelepiped of the vertex cone Ky. 

The curious reader might wonder what happens to the statement of Theo- 
rem 12.3 if we scale each of the generators w;,(v) by a different factor. It is 
immediate (Exercise 12.7) that the right-hand side of Theorem 12.3 remains 
invariant. 

There is an important difference between the vertex cone generating func- 
tions (integrals) that appear in the continuous version of Brion’s theorem 
(Theorem 12.3) and the vertex cone generating functions (sums) that appear 
in the discrete Brion theorem (Theorem 11.7). To see the difference, consider 
the following example: 

Let Ko be the first quadrant in R?, having generators (1,0) and (0,1). 
Let Ky be the cone defined as the nonnegative real span of (1,0) and (1, k). 
For k = 2! say, we see that for all practical purposes, K, is very close to 
Ko in its geometry, in the sense that their angles are almost the same for 
computational purposes, and thus their continuous Brion generating functions 
are almost the same, computationally. 

However, ox, (z) is quite far from ox, (z), since the latter has 21°° terms 
in its numerator, while the former has only 1 as its trivial numerator. Thus, 
tangent cones that are “arbitrarily close” geometrically may simultaneously 
be “arbitrarily far” from each other in the discrete sense dictated by the 
integer points in their fundamental domains. 


12.3 Polytopes Have Their Moments 


The most common notion for moments of a set P C R? is the collection of 
monomial moments defined by the real numbers 


aim f ytay = f tty? --uit dy, (12.8) 
P P 


for a fixed vector a = (a1,a@2,...,aq) € C%. For example, when a = 0 = 
(0,0,...,0), we get Wo = volP. As an application of moments, consider the 
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problem of finding the center of mass of P, which is defined by 


1 
[indy fray. f yady . 
vol P P P P 


This integral is equal to 


1 
ae (1(1,0,0,...,0)+ HM(0,1,0,...,0)0° °° , H(0,...,0,1)) : 
Lo 


Similarly, one can define the variance of P and other statistical data attached 
to P and use moments to compute them. 

Our next task is to relate the monomial moments [a to the formula of 
Theorem 12.3. One way to do this is to differentiate the continuous Brion 
identity in Theorem 12.3 repeatedly with respect to each z;, and in fact, 
interchanging these derivatives with the integral is valid because we are 
integrating over a compact region P. Thus, for a simple rational polytope P, 


a a a 
wT ae? saa? exp(e Zz) dx 
P 


= (8) (B) een 


Leyes ee ay exp (v2) [det (wi(v),-.-,walv))I 


d 
v a vertex of P Rie (wiv) : Z) 


If we take the limit as each z; — 0 of both sides of the latter identity, we have 
a formula (albeit a messy one) for the monomial moments: 


(—1)"ta = 
lim 3 (Z ye & ie exp (v - z) |det (wi(v),..., wa(v))| . 
Z— Zy 2d 


d 
va vertex of P Teer (wi, (v) . Z) 


This formula for the monomial moments may be used effectively for rela- 
tively small integer values of the exponent a = (aj, d2,...,@a). Going a step 
further, we can use Theorem 12.3 to obtain information about a different set 
of moments, which are sometimes called axial moments. Along the way, we 
stumble upon the following amazing formula for the continuous volume of a 
polytope. 


Theorem 12.4. Suppose P is a simple rational convex d-polytope. For each 
verter cone Ky of P, fix a set of generators wi(v),wa(v),...,wa(v) € Z¢. 
Then 


(—1)¢ > (v -z)“ |det (wi(v),...,wa(v))| 


d 
v a vertex of P bees (Wi, (v) . Z) 
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for all z such that the denominators on the right-hand side do not vanish. 
More generally, for every integer j > 0, 


aie Oo (v +z)’ *“ |det (wi(v),..., wa(v))| 
[i dx = 3 | 


(J + d)! v a vertex of P Ts (Wi, (v) . Z) 


Proof. We replace the variable z in the identity of Theorem 12.3 by sz, where 
5 is a scalar: 


exote<(aoiige = (14 exp (v - (sz)) |det (wi(v),...,wa(v))| 
[ewe (om)ax=(-y? 


v a vertex of P io (Wr (v) : (sz)) 


? 


which can be rewritten as 


exp (s (x -z)) dx = (—1)4 exp (s(v -z)) |det (wi(v),..., wa(v))| 
[explo ae= (1 | 


v a vertex of P st He4 (wx (v) : (z)) 


The general statement of the theorem follows now by first expanding the 
exponential functions as Taylor series in s, and then comparing coefficients 
on both sides: 


yi (+2)? dx * 
j>07P J: 


, gid Wi(V),..-,Walv 
(aj 58 Soe Ga el) 


7 d 
v a vertex of P j7>0 J Tat (wi(v) : (z)) 


=SYi-1* S&S (v-z)’*4 |det (wi(v),...,wa(v))| 8! 


j2—d v a vertex of P les (w(v) . (z)) Gi + d)! , 


The proof of this theorem reveals yet more identities between rational 
functions. Namely, the coefficients of the negative powers of s in the last line 
of the proof have to be zero. This immediately yields the following curious 
set of d identities for simple d-polytopes: 


Corollary 12.5. Suppose P is a simple rational convex d-polytope. For each 
vertex cone Ky of P, fix a set of generators wi(v),Wo(v),...,wa(v) € Z?. 
Then for eachO <j <d—-1, 


3 (v -z)’ |det (wi(v),..., wa(v))| 


v a vertex of P Wei (wx (v) : (z)) 
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12.4 Computing the Discrete Continuously 


In this section, we apply the Todd operator to a perturbation of the continuous 
volume. Namely, consider a simple full-dimensional polytope P, which we 
may write as 


P={xeR?: Ax<b}. 
Then we define the perturbed polytope 
P(h) :-= {xe R?: Ax <b+h} 


for a small vector h € R™ (we will quantify the word small in a moment). A 
famous theorem due to Askold Khovanskii and Aleksandr Pukhlikov says that 
the integer-point count in P can be obtained by applying the Todd operator 
to vol(P(h)). Here we prove the theorem for a certain class of polytopes, 
which we need to define first. 

We call a rational pointed d-cone unimodular if its generators are a basis 
of Z?. An integral polytope is unimodular if each of its vertex cones is 
unimodular.! 


Theorem 12.6 (Khovanskii—Pukhlikov theorem). For a unimodular d- 
polytope P, 
#(P AZ") = Toddp vol (P(h))|,~9 - 


More generally, 


op(expz) = Toddn [ oe -Z) dx 
P(h 


h=0 


Proof. We use Theorem 12.3, the continuous version of Brion’s theorem; note 
that if P is unimodular, then P is automatically simple. For each vertex 
cone Ky of P, denote its generators by w1(v), w2(v),.-.,Wa(v) € Z?. Then 
Theorem 12.3 states that 


exp(x-z)dx — (—1)4 exp (v - z) |det (wi(v),..., wa(v))| 
I, a va Pa of P [3 (wi. (v) : Z) 


=(-17 §O cca a (12.9) 


d 
va vertex of P Hees (wiv) : Z) 


where the last identity follows from Exercise 12.3. A similar formula holds 
for P(h), except that we have to account for the shift of the vertices. The 
vector h shifts the facet-defining hyperplanes. This shift of the facets induces 
a shift of the vertices; let’s say that the vertex v gets moved along each edge 
direction w;, (the vectors that generate the vertex cone K,,) by hy(v), so that 


! Unimodular polytopes go by two additional names, namely smooth and Delzant. 
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P(h) has now the vertex v — seme he(v)w;,(v). If h is small enough, P(h) 
will still be simple,” and we can apply Theorem 12.3 to P(h): 


exp ((v - > hi(v)we(v)) . 2) 


__4)4 
[orem 1) ba 


wauenole Tas (we(v) 2) 
“ye exp (v2 — Df he(v)walv) 2) 
enact TIeas (we(v) 2) 
=e yy ete) Ter xm (ha v)wra(v) 9) 


d 
va vertex of P Hai (wi. (v) : Z) 


Strictly speaking, this formula holds only for h € Q™, so that the vertices 
of P(h) are rational. Since we will eventually set h = 0, this is a harmless 
restriction. Now we apply the Todd operator: 


Toddy [ exp(x - z) dx 
P(h) 


h=0 


=(-18 SD Todd, SP) Mies exp (hu (v)we(v) -2) 


d 
v vertex of P Tied (wi, (v) . Z) 


h=0 
= (-1)4 Ss ene) 
v vertex of P Wes (wiv) . Z) 
d 
x |] Todd,,,...) exp (—he(v)we(v) - 2) 
k=l he(v)=0 


By a multivariate version of Lemma 12.1, 


Toddy [ exp(x - z) dx 
P(h) 


h=0 


d 
= Gif \ exp (v -z) Il —wrz(v)-z 


v vertex of P Te (wi(v) , z) k=1 t= exp(wz(v) ; z) 


d 

1 

— exp (v - z) : 
Vv 2 of P Il _ exp(wx(v) ; z) 

However, Brion’s theorem (Theorem 11.7), together with the fact that P is 

unimodular, says that the right-hand side of this last formula is precisely the 

integer-point transform of P (see also (12.9)), and thus 


? The cautious reader may consult [258, p. 66] to confirm this fact. 
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Toddy | exp(x - z) dx 
P(h) 


Finally, setting z = 0 gives 


Todd, i dx 
P(h) 


h=0 mePnZ4 


as claimed. 


We note that J, (h) exp(x- z) dx is, by definition, the continuous Fourier— 
Laplace transform of P(h). Upon being acted on by the discretizing operator 
Toddy, the integral Spun) exp(x - z) dx gives us the discrete integer-point 
transform op(z). 


Notes 


1. The classical Euler—Maclaurin formula states that 


=f ‘i hie £(0 mie HO + iw) > oan [Fe (a)] 


n 


m=1 ® 


1 n 

— B (2p+1) (4) 

+ apa [Boon ab fo" (@) dr, 

where B;,(2) denotes the k'* Bernoulli polynomial. It was discovered indepen- 
dently by Leonhard Euler and Colin Maclaurin (1698-1746).? This formula 
provides an explicit error term, whereas Theorem 12.2 provides a summation 
formula with no error term. 


2. The Todd operator was introduced by Friedrich Hirzebruch in the 1950s 
[138], following a more complicated definition by John A. Todd [243,244] some 
twenty years earlier. The Khovanskii-Pukhlikov theorem (Theorem 12.6) can 
be interpreted as a combinatorial analogue of the algebrogeometric Hirzebruch— 
Riemann—Roch theorem, in which the Todd operator plays a prominent role. 


3. Theorem 12.3, the continuous form of Brion’s theorem, was generalized by 
Alexander Barvinok to every polytope [19]. In fact, [19] contains a certain 
extension of Brion’s theorem to irrational polytopes as well. The decomposition 
formula for moments of a polytope in Theorem 12.4 is due to Michel Brion 
and Michéle Vergne [67]. 


3 For more information about Maclaurin, see 
http://www-history.mcs.st-andrews.ac.uk/Biographies/Maclaurin.html. 
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A, A natural question arises regarding moments of a given polytope, namely 
the following inverse problem: how many axial moments of a polytope P 
do we need to know in order to reconstruct the vertices of P? A solution 
was given in [122] recently, and it turns out to be at most O(dN), where 
d= dim(P) and N is the number of vertices of P. Practical implementations 
of this algorithm, which find applications in computer graphics and computer 
vision, are still being developed. A nice resource for related moment problems 
is the book [158). 


5. Theorem 12.6 was first proved in 1992 by Askold Khovanskii and Aleksandr 
Pukhlikov [147]. The proof we give here is essentially theirs. Their paper [147] 
also draws parallels between toric varieties and lattice polytopes. Subsequently, 
many attempts to provide formulas for Ehrhart quasipolynomials—some 
based on Theorem 12.6—have provided fertile ground for deeper connections 
and future work; a long but by no means complete list of references is 
[15, 16, 50, 67, 79, 85, 86, 106, 127, 145, 146, 159, 168, 182, 193, 240]. 


Exercises 


12.1. & Show that Todd, h =h+ 5. More generally, prove that 
Todd, h*® = B,(h +1) 


for k > 1, where B;,(a) denotes the k** Bernoulli polynomial. Thus, the Todd 
operators take the usual monomial basis to the basis of Bernoulli polynomials. 


12.2. & Prove (12.5): Suppose P is a simple integral d-polytope. For each 
vertex cone Ky of P, denote its generators by wi(v), wa(v),...,wa(v) € Z4 
and its fundamental parallelepiped by II. Then 


» exp(m-z) = se exp(V: 2) Dmenynad xP (7 - 2) 


d . 
mePn(2z)* vaverere J [pe4 (1-exp (- -2)) 
12.3. & Given a unimodular cone 
K = {v + Arwi + Agwe +--+ + Aawa: Migdiuaces > Oo}, 


where v,W1,W2,...,wq € Z? such that w1,wo,...,wg are a basis for Z?, 
show that 


YA 


and |det (w1,...,wa)| = 1. 
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12.4. Compute the integral (Fourier—Laplace transform) given by the formula 
of Theorem 12.4 for the standard simplex in R®, 


A = {(21,22...,24) ER: ay tag+-+:+¢q<landall 2, >0}. 


12.5. Using the previous exercise, compute the center of mass of the standard 
simplex in R?. 


12.6. & Prove (12.7). That is, for the simplicial cone 


d 
ba eh Soham: 201 
k=1 


with v,w1,W2,...,Wa € Q%, show that 


a &Xp (v- z) |det (wi(v),...,wa(v))| 
TIi=1 (we(v) - 2) 


12.7. Show that in the statement of Theorem 12.3, the expression 


[ exvte-2) dx = (—1) 


|det (wi(v),...,Wa(v))| 
Tai (we(v) +2) 


remains invariant on scaling each wz(v) by an independent positive integer. 


12.8. The binomial transform of a given sequence (a,,)°2, is the sequence 
by = yea dx. Prove that the Bernoulli numbers B,, are self-dual 
with respect to this transform: 


Be Si-t (;) By. 


12.9. Prove that if P is an integral d-polytope and f(a1,22,...,va) is a 
polynomial in d variables, then the following expression is a polynomial in t: 


Lp s(t):= > flm). 


méetPNZ4 


Open Problems 


12.10. Find all differentiable eigenfunctions of the Todd operator. 


12.11. (a) Classify all polytopes whose discrete and continuous volumes coin- 
cide, that is, Lp(1) = vol(P). 
(b) Classify all (not necessarily closed) polytopes for which Lp(t) = vol(P) t?. 


Chapter 13 
Solid Angles 


Everything you’ve learned in school as “obvious” becomes less and less obvious as you 
begin to study the universe. For example, there are no solids in the universe. There’s not 
even a suggestion of a solid. There are no absolute continuums. There are no surfaces. 
There are no straight lines. 


Buckminster Fuller (1895-1983) 


The natural generalization of a 2-dimensional angle to higher dimensions is 
called a solid angle. Given a pointed cone K C R%, the solid angle at its apex 
is the proportion of space that the cone K occupies. In slightly different words, 
if we pick a point x € R? “at random,” then the probability that x € K is 
precisely the solid angle at the apex of K. Yet another view of solid angles is 
that they are in fact volumes of spherical polytopes: the region of intersection 
of a cone with a sphere. There is a theory here that parallels the Ehrhart 
theory of Chapters 3 and 4, but which has some genuinely new ideas. 


13.1 A New Discrete Volume Using Solid Angles 


Suppose P C R? is a convex rational d-polyhedron. The solid angle wp (x) 
of a point x (with respect to P) is a real number equal to the proportion of a 
small ball centered at x that is contained in P. That is, we let B.(x) denote 
the ball of radius € centered at x and define 


__ vol (B(x) NP) 
Be Ge 
for all positive € sufficiently small; this notion is depicted in Figure 13.1. We 
note that when x ¢ P, wp(x) = 0; when x € P°, wp(x) = 1; when x € OP, 
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0 < wp(x) < 1. The solid angle of a face ¥ of P is defined by choosing a 
point x in the relative interior F° and setting wp(F) = wp(x). 


Fig. 13.1 A solid angle. 


Example 13.1. We compute the solid angles of the faces belonging to the 
standard 3-simplex A = conv {(0, 0,0), (1, 0,0), (0, 1,0), (0,0,1)}. As we just 
mentioned, a point interior to A has solid angle 1. Every facet has solid angle 
4 (and this remains true for every polytope). 

The story gets interesting with the edges: here we are computing dihedral 
angles. The dihedral angle of a 1-dimensional edge is defined by the angle 
between the outward-pointing normal to one of its defining facets and the 
inward-pointing normal to its other defining facet. 


Fig. 13.2 The simplex A 
from Example 13.1. 


Each of the edges OA, OB, and OC in Figure 13.2 has the same solid 
angle i. Turning to the edge AB, we compute the angle between its defining 
facets as follows: 
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5 cos! (= (-1,-1,-1)- (0,0,-1)) = 5 cos! (=) 


The edges AC and BC have the same solid angle by symmetry. 

Finally, we compute the solid angle of the vertices: the origin has solid 
angle 3 and the other three vertices all have the same solid angle w. With 
Corollary 13.9 below (the Brianchon-Gram relation), we can compute this 
angle via 


1.9. a 1 
= > C1)? wp(F) = aa 3- 3-— cos? (a) + ge. 
FCP 


ee ae ee 6 ee ee 
which gives w = 5= cos (=) q- 


We now introduce another measure of discrete volume; namely, we let 
Ap(t):= >> wip(m), 
meétPNnZ4 


the sum of the solid angles at all integer points in ¢P; recalling that wp(x) = 0 
if x ¢ P, we can also write 


Ap(t)= S > wep(m). 
meZé 


This new discrete volume measure differs in a substantial way from the Ehrhart 
counting function Lp(t). Namely, suppose P is a d-polytope that can be 
written as the union of the polytopes P; and Pz such that dim (P, MP2) < d, 
that is, P; and P2 are glued along a lower-dimensional subset. Then at each 
lattice point m € Z“, we have wp, (m)+wp,(m) = wp(m), and so the function 
Ap has an additive property: 


Ap(t) = Ap, (t) + Ap, (t)- (13.1) 
In contrast, the Ehrhart counting functions satisfy 
Lp (t) = Lp, (t) + Lp, (t) ~ Lp,nPs (t) : 


On the other hand, we can transfer computational effort from the Ehrhart 
counting functions to the solid-angle sum and vice versa, with the use of the 
following lemma. 


Lemma 13.2. Let P be a polytope. Then 


Ap(t) = S> wp(F) Lro(t). 


FOP 
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Proof. The dilated polytope tP is the disjoint union of its relative open faces 
tF° (Exercise 5.3), so that we can write 


Ap(t) = ba wep (m =e y, wp (m) 1,7>(m) . 
meZ?4 FCP meZz4 


But w;p(m) is constant on each relatively open face t¥°, and we called this 
constant wp(F), whence 


Ap(t)= S° we(F) S> lixe(m) = $0 wp(F) Lee(t)- 


FCP meZ¢é FCP 


Thus Ap(t) is a polynomial (respectively quasipolynomial) in t for an 
integral (respectively rational) polytope P. We claim that Lemma 13.2 is in 
fact useful in practice. To drive the point home, we illustrate this identity by 
computing the solid-angle sum over all integer points of A in Example 13.1. 


Example 13.3. We continue the solid-angle computation for the 3-simplex 
A= conv {(0, 0,0), (1,0, 0), (0,1, 0), (0,0, 1)}. We recall from Section 2.3 that 
Lyao = CS ) The facets of A are three standard triangles and one triangle 
that appeared in the context of the Frobenius problem. All four facets have 
the same interior Ehrhart polynomial are? A similar phenomenon holds for 
the edges of A: all six of them have the same interior Ehrhart polynomial 
t—1. These polynomials add up, by Lemma 13.2 and Example 13.1, to the 
solid-angle sum 


Aat)= ("5") +4-3(°5') (3-7 3- = cos”! (=))u-» 
A 


ee: ages = oe t 
6 Qn gay da 


The magic cancellation of the even powers of this polynomial is not a coinci- 
dence, as we will discover in Theorem 13.7. The curious reader may notice 
that the coefficient of t in this example is not a rational number, in stark 
contrast with Ehrhart polynomials. 


The analogue of Ehrhart’s theorem (Theorem 3.23) in the world of solid 
angles is as follows. 


Theorem 13.4 (Macdonald’s theorem). Suppose P is a rational convex 
d-polytope. Then Ap is a quasipolynomial of degree d whose leading coefficient 
is volP and whose period divides the denominator of P. 


Proof. The denominator of a face F C P divides the denominator of P, and 
hence so does the period of L+, by Ehrhart’s theorem (Theorem 3.23). B 
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Lemma 13.2, Ap is a quasipolynomial with period dividing the denominator 
of P. The leading term of Ap equals the leading term of Lp, which is vol P, 
by Corollary 3.20 and its extension in Exercise 3.34. 


13.2 Solid-Angle Generating Functions and a 
Brion-Type Theorem 


By analogy with the integer-point transform of a polyhedron P C R?, which 
lists all lattice points in P, we form the solid-angle generating function 


ap(z):= > wp(m)2™. 


meEPnZ4 


Using the same reasoning as in (13.1) for Ap, this function satisfies a nice 
additivity relation. Namely, if the d-polyhedron P equals P, U P2, where 
dim (PN P2) < d, then 


ap(z) = ap, (z) + ap,(z). (13.2) 


The solid-angle generating function obeys the following reciprocity relation, 
which parallels both the statement and proof of Theorem 4.3: 


Theorem 13.5. Suppose K is a rational pointed d-cone with the origin as 
apex, and v € R¢. Then the solid-angle generating function ay+x(z) of the 
pointed d-cone v + K is a rational function that satisfies 


aver (2) = (1) avse (2) 


Proof. Because solid angles are additive by (13.2), it suffices to prove this 
theorem for simplicial cones. The proof for this case proceeds along the same 
lines as the proof of Theorem 4.2; the main geometric ingredient is Exercise 4.2. 
We invite the reader to finish the proof (Exercise 13.6). 


The analogue of Brion’s theorem in terms of solid angles is as follows. 


Theorem 13.6. Suppose P is a rational convex polytope. Then we have the 
following identity of rational functions: 


ap(z) = > ak, (Z) . 


v a vertex of P 


Proof. As in the proof of Theorem 11.7, it suffices to prove Theorem 13.6 for 
simplices. So let A be a rational simplex. We write A as the disjoint union of 
its open faces and use Brion’s theorem for open polytopes (Exercise 11.9) on 
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each face. That is, if we denote the vertex cone of F at vertex v by K,(F), 
then by a monomial version of Lemma 13.2, 


aa(z) = D7 wa(F) oFe(z) 


FoL 
= x wa(v) ZY + wa(F) Va OK, (F)°(Z) ; 
v a vertex of A Fee va vertex of F 
dim F>0 


where we used Brion’s theorem for open polytopes (Exercise 11.9) in the 
second step. By Exercise 13.7, 


> walF) SS one) = SY YS ux,(Fore(2), 


FCA va vertex of F va vertex of A FCKy, 
dim F>0 dim F>0 
and so 
Vv 
aa(Z) = y wa(v) ZY + ) ) wx, (F) oF (Z) 
va vertex of A va vertex of A FCKy 
dim F>0 


l| 


S> wey (F) oFe (2) 


va vertex of A FCKy 


> a, (Z) . 


va vertex of A 


I 


13.3 Solid-Angle Reciprocity and the Brianchon—Gram 
Relations 


With the help of Theorems 13.5 and 13.6, we can now prove the solid-angle 
analogue of Ehrhart—Macdonald reciprocity (Theorem 4.1): 


Theorem 13.7 (Macdonald’s reciprocity theorem). Suppose P is a ra- 
tional convex polytope. Then the quasipolynomial Ap satisfies 


Ap(—t) = (-1)%™? Ap(t). 


Proof. We give the proof for an integral polytope P and invite the reader to 
generalize it to the rational case. The solid-angle counting function of P can 
be computed through the solid-angle generating function of tP: 


Ap(t) _ arp (1, I teeny 1) = lim arp (Z) : 
zl 


By Theorem 13.6, 


13.3 Solid-Angle Reciprocity and the Brianchon—Gram Relations 233 


Ap(t) = lim S> aux, (2), 
v a vertex of P 


where Ky is the tangent cone of P at the vertex v. We write Ky =v +K(v), 
where K(v) := Ky — v is a rational cone with the origin as its apex. Then 
tK, =tv+K(v), because a cone whose apex is the origin does not change 
under dilation. Hence we obtain, with the help of Exercise 13.5, 


Ap(t) = lim a z) = lim z'%a Z). 
as. va 2» of P eee a= va & of P ce 
The rational functions ax,,)(z) on the right-hand side do not depend on t. If 
we think of the sum over all vertices as one big rational function, to which we 
apply L’H6pital’s rule to compute the limit as z > 1, this gives an alternative 
proof that Ap(t) is a polynomial, in line with our proof for the polynomiality 
of Lp(t) in Section 11.5. At the same time, this means that we can view the 
identity 
Ap(t)=lim Stax (2) 


zl 
v a vertex of P 


in a purely algebraic fashion: on the left-hand side, we have a polynomial 
that makes sense for every complex t, and on the right-hand side, we have a 
rational function of z, whose limit we compute, for example, by L’H6pital’s 
rule. So the right-hand side, as a function of t, makes sense for every integer 
t. Hence we have the algebraic relation, for integral t, 


ion =]; —tv 
Ap(—t) = lim » aK (v)(Z) - 


But now by Theorem 13.5, axy)(z) = (—1)4 axiy) (4), and so 


1 
3) = Ti tv / _7\2 
Ap(—t) jim > ae (-)) axe ( ) 


=(-Hfim ey axw (=) 


v a vertex of P 


: 1 
= (-1)" im x Otv+K(v) (z) 


v a vertex of P 


In the third step, we used Exercise 13.5 again. 
This proves Theorem 13.7 for integral polytopes. The proof for rational 
polytopes follows along the same lines; one deals with rational vertices in the 
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same manner as in our second proof of Ehrhart’s theorem in Section 11.5. We 
invite the reader to finish the details in Exercise 13.8. 


We remark that throughout the proof, we cannot simply take the limit 
inside the finite sum over the vertices of P, since z = 1 is a pole of each 
rational function axy). It is precisely the magic of Brion’s theorem that 
makes these poles cancel each other, to yield Ap(t). 

If P is an integral polytope, then Ap is a polynomial, and Theorem 13.7 
tells us that Ap is always even or odd: 


Ap(t) =Cq i Cq_ott-? ++ +9. 
We can say more. 
Theorem 13.8. Suppose P is a rational convex polytope. Then Ap(0) = 0. 


This is a meaningful zero. We note that the constant term of Ap is given by 


Ap(0) = >> wp(F) Lre(0) = S> we(F)(-)™%, 


FCP FCP 


by Lemma 13.2 and Ehrhart—Macdonald reciprocity (Theorem 4.1). Hence 
Theorem 13.8 implies a classical and useful geometric identity: 


Corollary 13.9 (Brianchon—Gram relation). For a rational convex poly- 
tope P, 


S> (-1)9™* wp(F) = 0. 


FCP 


Example 13.10. Consider a triangle J in R? with vertices v1, v2,v3 and 
edges E), E2, F3. The Brianchon—Gram relation tells us that for this triangle, 


wr (v1) + wr (v2) +wr (vs) — wr (E1) + wr (E2) + wr (E3)) + wr (T) = 0. 
Since the solid angles of the edges are all 5 and wz (7) = 1, we recover our 


friendly high-school identity the sum of the angles in a triangle is 180 degrees: 


wr (v1) +47 (v2) + wr (vs) = 5 


Thus the Brianchon—Gram relation is the extension of this well-known fact to 
every dimension and every convex polytope. 


Proof of Theorem 13.8. It suffices to prove A,a(0) = 0 for a rational simplex 
A, since solid angles of a triangulation simply add, by (13.1). Theorem 13.7 
gives A, (0) = 0 if dimA is odd. 

So now suppose A is a rational d-simplex, where d is even, with vertices 
V1,V2,---,Va+1- Let P(n) be the (d+ 1)-dimensional pyramid that we obtain 


13.3 Solid-Angle Reciprocity and the Brianchon—Gram Relations 235 


Fig. 13.3 The pyramid 
P(n) for a triangle A. 


by taking the convex hull of (v1, 0), (v2,0),...,(va4i,0), and (0,0,...,0,7), 
where n is a positive integer (see Figure 13.3). Note that since d+ 1 is odd, 


Apny(0) = Sd) (D984 wpiny(F(n)) = 0. 
F(n)CP(n) 


We will conclude from this identity that 0c, (—1)4™7 wa (F) = 0, which 
implies that Aa (0) = 0. To this end, we consider two types of faces of P(n): 


(a) those that are also faces of A; 
(b) those that are not contained in A. 


We begin with the latter: Aside from the vertex (0,0,...,0,n), every face 
F(n) of P(n) that is not a face of A is the pyramid over a face G of A; let’s 
denote this pyramid by Pyr (G,n). Further, as n grows, the solid angle of 
Pyr (G,n) (in P(n)) approaches the solid angle of G (in A): 


Jim, Wp(n) (Pyr (G,n)) = wa (G), 


since we are forming A x [0,00) in the limit. On the other hand, a face 
F(n) =G of P(n) that is also a face of A obeys the following limit behavior: 


1 
dim wp(n) (F(n)) = 5 wa (9). 


The only face of P(n) that we still have to account for is the vertex v := 
(0,0,...,0,n). Hence 


0= We (-1)8™ 7 wpm (F(n)) 
F(n)CP(n) 


GCA GCA 
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Now we take the limit as n — 00 on both sides; note that limn_,.. wp(n) (Vv) = 
0, so that we obtain 


0= es ao aed (G) +4 ba (—1)8 9 wn (G) 


GCA GCA 


= ; a Caen. (G) 


GCA 


and so 


Aa(0) = 52 (-1)%™Swa(G) = 0. 


GCA 


The combination of Theorems 13.7 and 13.8 implies that summing solid 
angles in a polygon is equivalent to computing its area: 


Corollary 13.11. Suppose P is a 2-dimensional integral polytope with area a. 
Then Ap(t) =at?. 


13.4 The Generating Function of Macdonald’s 
Solid-Angle Polynomials 


We conclude this chapter with the study of the solid-angle analogue of Ehrhart 
series. Given an integral polytope P, we define the solid-angle series of P as 
the generating function of the solid-angle polynomial, encoding the solid-angle 
sum over all dilates of P simultaneously: 


Solidp(z) = S> Ap(t) 2". 


t>0 


The following theorem is the solid-angle analogue to Theorems 3.12 and 4.4, 
with the added bonus that we get the palindromicity of the numerator of 
Solidp for free. 


Theorem 13.12. Suppose P is an integral d-polytope. Then Solidp is a ra- 
tional function of the form 


agz? + ag_127 1 +--+ +442 


Solidp (z) = fa _ 2 


Furthermore, we have the identity 
1 
Solidp (3) => (<1) Solidp(z), 
Zz 


or equivalently, a, = da4i-k for 1 <k< g. 
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Proof. The form of the rational function Solidp follows, by Lemma 3.9, from 
the fact that Ap is a polynomial. The palindromicity of a,,a2,...,aq is 
equivalent to the relation 


Solidp (<) = (—1)**" Solida(z), 


z 
which, in turn, follows from Theorem 13.7: 


Solidp(z) = $0 Ap(t) 2* = $0 (-1)*Ap(-t) 2* = (-1)4 > Ap(t) 2. 


t>0 t>0 t<0 


Now we use Exercise 4.7: 


(-1)4 0 Ap(t) 27 = (-1)41 SS Ap(t) 2 = (-1)" Solidp (+) 


t<0 t>1 


In the last step, we used the fact that Ap(0) = 0 (Theorem 13.8). 


Notes 


1. I. G. Macdonald inaugurated the systematic study of solid-angle sums 
in integral polytopes. The fundamental Theorems 13.4, 13.7, and 13.8 can 
be found in his 1971 paper [166]. The proof of Theorem 13.7 we give here 
follows [38]. 


2. The Brianchon-Gram relation (Corollary 13.9) is the solid-angle analogue 
of the Euler relation for face numbers (Theorem 5.2). The 2-dimensional case 
discussed in Example 13.10 is ancient; it was most certainly known to Euclid. 
The 3-dimensional case of Corollary 13.9 was discovered by Charles Julien 
Brianchon in 1837 and—as far as we know—was independently re-proved 
by Jérgen Gram in 1874 [121]. It is not clear who first proved the general 
d-dimensional case of Corollary 13.9. The oldest proofs we could find were 
from the 1960s, by Branko Griinbaum [126], Micha A. Perles, and Geoffrey C. 
Shephard [189, 218]. 


3. Theorem 13.5 is a particular case of a reciprocity relation for simple 
lattice-invariant valuations due to Peter McMullen [172], who also proved 
a parallel extension of Ehrhart-Macdonald reciprocity to general lattice- 
invariant valuations. There is a current resurgence of activity on solid angles; 
see, for example, [45, 76, 103, 115, 204]. 
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Exercises 


13.1. Compute Ap(t), where P is the regular tetrahedron with vertices 
(0,0,0), (1,1,0), (1,0,1), and (0,1,1) (see Exercise 2.13). 


13.2. Compute Ap(t), where P is the rational triangle with vertices (0,0), 
(5,4), and (1,0). 

13.3. For a simplex A, let S(A) denote the sum of the solid angles at the 
vertices of A. 


(a) Prove that S(A) < 5. 
(b) Construct a sequence A,, of simplices in a fixed dimension, such that 
limp—+oo S(An) = 5. 


13.4. Let Z be a d-dimensional integral zonotope. Show that Az(t) = 
vol(Z) t4. 


13.5. & Let K be a rational d-cone and let m € Z?. By analogy with Exer- 
cise 3.9, show that am+x(z) = 2™ ax (Zz). 


13.6. & Complete the proof of Theorem 13.5: For a rational pointed d-cone 
K, ax(z) is a rational function that satisfies 


ax (2) = (Dax (2). 


13.7. & Suppose A is a rational simplex. Prove that 


Yo waF) SD on) = > So ey (F) oFe (2). 
FCA v a vertex of F va vertex of A FCKy, 
dim F>0 dim F>0 


13.8. & Provide the details of the proof of Theorem 13.7 for rational polytopes: 
Prove that if P is a rational convex polytope, then the quasipolynomial Ap 


satisfies 
Ap(—t) = (-)™? Ap(é). 


13.9. Recall from Exercise 3.2 that to a permutation 7 € Sq on d elements, 
we can associate the simplex 


A, := conv {0, Crtijs On(iy + Cpajes s+ Oatiy + Oxayh + e,(a)} . 
Prove that for all  € S,, Aa, (t) = 7 t?. 


13.10. Give a direct proof of Corollary 13.11, e.g., using Pick’s theorem 
(Theorem 2.8). 


13.11. State and prove the analogue of Theorem 13.12 for rational polytopes. 
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Open Problems 


13.12. Study the roots of solid-angle polynomials. 


13.13. Classify all polytopes that have only rational solid angles at their 
vertices. 


13.14. Classify all rational polytopes P such that Ap(t) = vol(P) t?. 


13.15. Which integral polytopes P have an integral solid-angle sum? More 
generally, which integral polytopes P have solid-angle polynomials Ap(t) € 
Qt]? That is, for which integral polytopes P are all the coefficients of Ap(t) 
rational? (For one such class of polytopes, see [123].) 


Chapter 14 


A Discrete Version of Green’s 
Theorem Using Elliptic Functions 


The shortest route between two truths in the real domain passes through the complex do- 
main. 


Jacques Salomon Hadamard (1865-1963) 


We now allow ourselves the luxury of using basic complex analysis. In par- 
ticular, we assume that the reader is familiar with contour integration and 
the residue theorem. We may view the residue theorem as yet another result 
that intimately connects the continuous and the discrete: it transforms a 
continuous integral into a discrete sum of residues. 

Using the Weierstraf o- and ¢-functions, we show here that Pick’s theorem 
is a discrete version of Green’s theorem in the plane. As a bonus, we also 
obtain an integral formula (Theorem 14.5 below) for the discrepancy between 
the area enclosed by a general curve C' and the number of integer points 
contained in C. 


14.1 The Residue Theorem 


We begin this chapter by reviewing a few concepts from complex analysis. 
Suppose the complex-valued function f has an isolated singularity w € G; 
that is, there is an open set G C C such that f is analytic on G \ {w}. Then 
f can be expressed locally by the Laurent series 


fae hae (z-—w)", 


nezZ 


valid for all z € G\ {w}; here c, € C. The coefficient c_; is called the 
residue of f at w; we will denote it by Res(z = w). The reason to give c_; 
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a special name can be found in the following theorem. We call a function 
meromorphic if it is analytic in C with the exception of isolated poles. 


Theorem 14.1 (Residue theorem). Suppose f is meromorphic and C is 
a positively oriented, piecewise differentiable simple closed curve that does not 
pass through any pole of f. Then 


i — 2ni S” Res(z = w), 
Cc Ww 


where the sum is taken over all singularities w inside C. 


If f is a rational function, Theorem 14.1 gives the same result as a partial 
fraction expansion of f. We illustrate this philosophy by returning to the 
elementary beginnings of Chapter 1. 


Example 14.2. Recall our constant-term identity for the restricted partition 
function for A = {a1,a2,...,aa} in Chapter 1: 


pa(n) = const (7 = =o = or} : 


Computing the constant term of the Laurent series of (aah) (laa*ayar 
expanded about z = 0 is naturally equivalent to “shifting” this function by 
one exponent and computing the residue at z = 0 of the function 


1 
(1 — 2%) (1 — 2@)--- (1 — 2@e) grt : 


f@)i= 


Now let C;. be a positively oriented circle of radius r > 1, centered at the 
origin. The residue Res(z = 0) = pa(n) is one of the residues that are picked 
up by the integral 


ay a. f = Res(z y+ as 


where the sum is over all nonzero poles w of f that lie inside C;,. These 
poles are at the at®,at?,...,a't roots of unity. Moreover, with the help of 


Exercise 14.1, we can show that 


roo QTt C, 


| 
a 
B 
= 
| 
S 
+ 
aw 
B 
<° 
l 
= 
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where the sum extends over all at, at®,..., a? roots of unity. In other words, 


pa(n) = Res(z = 0) Sat =) 


To obtain the restricted partition function p,, it remains to compute the 
residues at the roots of unity, and we invite the reader to realize that this 
computation is equivalent to the partial fraction expansion of Chapter 1 
(Exercise 14.2). 


Analogous residue computations could replace any of the constant-term 
calculations that we performed in the earlier chapters. 


14.2 The WeierstraB o- and ¢-Functions 


The main character in our play is the Weierstra8 ¢-function, defined by 


1 1 1 z 
Ce Se S- (tea tae a). (14.1) 


(m,n)€Z?\ (0,0) 


This infinite sum converges uniformly on compact subsets of the lattice- 
punctured plane C \ Z? (Exercise 14.5), and hence forms a meromorphic 
function of z. 

The Weierstraf ¢-function possesses the following salient properties, which 
follow immediately from (14.1): 


(1) ¢ has a simple pole at every integer point m+ ni and is analytic elsewhere. 
(2) The residue of ¢ at each integer point m+ ni equals 1. 


We can easily check (Exercise 14.6) that 


(m,n)EZ?\ (0,0) a ni)) (m Li ni) 

(14.2) 
the Weierstra8 ¢-function. The g-function has a pole of order 2 at each 
integer point m+ ni and is analytic elsewhere, but has residue equal to zero at 
each integer point m+ ni. However, 9 possesses a very pleasant property that 
¢ does not: g is doubly periodic on C. We may state this more concretely: 


Lemma 14.3. (z+ 1) = 9(z +t) = 9(z). 


Proof. We first invite the reader to prove the following two properties of ! 
(Exercises 14.7 and 14.8): 
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gp’ (z+1)=9"(z), (14.3) 
Zz 
/ @! (z) dz is path-independent. (14.4) 
Zo 
By (14.3), 
d 
qe +1) — @(2)) = (2 +1) — wz) =0, 
so 9(z +1) — e(z) =c for some constant c. On the other hand, g is an even 
function (Exercise 14.9), and so z = —4 gives us 


c= (3) —»(-3) =9. 


This shows that g (z + 1) = e(z) for all z € C\ Z?. An analogous proof, which 
we invite the reader to construct in Exercise 14.10, shows that o (z + 7) = ¢(z). 


Lemma 14.3 implies that o(z + m+ ni) = 9(z) for all m,n € Z. The 
following lemma shows that the Weierstra8 ¢-function is only a conjugate- 
analytic term away from being doubly periodic. 


Lemma 14.4. There is a constant a such that the function ¢(z) + az is 
doubly periodic with periods 1 and i. 


Proof. We begin with w = m+ ni: 
m+ni 
G(e+mtni)—c2)=- f o(z+w)du, (14.5) 
w=0 


by definition of g(z) = —¢'(z). To make sure that (14.5) makes sense, we 
should also check that the definite integral in (14.5) is path-independent 
(Exercise 14.11). 

Due to the double periodicity of ¢, 


m+ni 1 ll 
/ o(e+w)dw=m f plettat+ni f go (z+ it) dt 
w=0 0 0 


= ma(2) + niB(2) 


where 


a(z) =| o(z+t) dt and B(z) =| o(z+ it) dt. 


Now we observe that a (z+ 29) = a(z) for every xp € R, so that a (a + iy) 
depends only on y. Similarly, 6 (a + iy) depends only on x. But 


¢ (z +m + in) — ¢(z) = — (may) + inB(2)) 
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must be analytic for all z € C \ Z?. If we now set m = 0, we conclude that 
B(x) must be analytic in C \ Z?, so that (x) must be a constant by the 
Cauchy—Riemann equations for analytic functions. Similarly, setting n = 0 
implies that a(y) is constant. Thus 


¢(z+m-+in) — ¢(z) = — (ma + in’) 


with constants a and 6. Returning to the Weierstraf o-function, we can 
integrate the identity (Exercise 14.12) 


(iz) = —p(z) (14.6) 
to obtain the relationship 3 = —a, since 
1 1 1 
0 0 0 
To summarize, 


¢(z+m+in) — ¢(z) =—ma + ina = -a(z+m+in—-Z), 


so that ¢(z) + aZ is doubly periodic. 


14.3 A Contour-Integral Extension of Pick’s Theorem 


For the remainder of this chapter, let C' be any piecewise-differentiable simple 
closed curve in the plane, with a counterclockwise parameterization. We let 
D denote the region that C contains in its interior. 


Theorem 14.5. Let C avoid every integer point, that is, CN Z? = @. Let 
I denote the number of integer points interior to C, and A the area of the 
region D enclosed by the curve C. Then 


1 


Qri 


i (C(z) —7zZ) dz =I-A. 
C 
Proof. We have 
[Ge +ende= f ce)deta f (iy) (ae tidy), 
Cc Cc c 
where a is as in Lemma 14.4. By Theorem 14.1, [.,¢(z) dz is equal to the 


sum of the residues of ¢ at all of its interior poles. There are I such poles, 
and each pole of ¢ has residue 1. Thus 
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= C(z)dz =I. (14.7) 
271 Io 


On the other hand, Green’s theorem tells us that 


[ @-iv) ae + iay) = ff (win) de + (y+ in) dy 
C 


Cc 
7) : ; 
=| 5, u+ia)— 5 (iy) 


= ff 2 


= 2iA. 
Returning to (14.7), we get 
i (Abas de = Sri eae), (14.8) 
Cc 
We have only to show that a = —7z. Consider the particular curve C' that is 


a square path, centered at the origin, traversing the origin counterclockwise 
and bounding a square of area 1. Thus J = 1 for this path. Since ¢(z) + az 
is doubly periodic by Lemma 14.4, the integral in (14.8) vanishes. We can 
conclude that 

0 = 27i-14+a(2i-1), 


so that a = —7. 


Notice that Theorem 14.5 has given us information about the Weierstra8 
¢-function, namely that a = —7. 

This chapter offers a detour into an infinite landscape of discrete results 
that meet their continuous counterparts. Equipped with the modest tools 
offered in this book, we hope that we have motivated the reader to explore 
this landscape further. 


Notes 


1. The Weierstra8 o-function, named after Karl Theodor Wilhelm Weier- 
straB (1815-1897),! can be extended to every 2-dimensional lattice L = 
{kw + jwo: k,j € Z} for some wi, w2 € C that are linearly independent 


over R: 


meL\{0} 


1 For more information about Weierstra8, see 
http: //www-history.mcs.st-andrews.ac.uk/Biographies/Weierstrass.html. 
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The Weierstra8 gc-function and its derivative o, satisfy a polynomial rela- 
tionship, namely, (CA = 4(gc)* — 92 Pc — gs for some constants gz and g3 
that depend on £. This is the beginning of a wonderful friendship between 
complex analysis and elliptic curves. 


2. Theorem 14.5 appeared in [105]. There it is also shown that one can recover 
Pick’s theorem (Theorem 2.8) from Theorem 14.5. 


Exercises 


14.1. & Show that for positive integers a,,dg,..-,@a,7, 


1 
li =0. 
reo Jo, (I= 2%) (1— 294) aH 


This computation shows that the integrand above “has no pole at infinity.” 


14.2. & Compute the residues at the nontrivial roots of unity of 


1 
zZ)= : 
ba ) (1 — 2%)--- (1 — 244) grtt 
For simplicity, you may assume that a ),qa2,...,@q are pairwise relatively 


prime. 


14.3. This exercise yields a proof of (8.4), namely, that the Bernoulli-Barnes 
polynomials B(x), defined in Chapter 8 through the generating function 
gd et2 


k 
=~ BA(z) — 
(rahe tera 2 FO) 


allow us to express the polynomial part of p4(n) as 


poly 4 (7) = C= 1)! d 


As above, we let 


1 
Zi) = 
f( ) (1 — 2%)--- (1 — 2%) grt 
and assume that a,,a2,...,@q are pairwise relatively prime. 


(a) Show that poly ,(m) equals the negative of the residue of f(z) at z = 1. 

(b) Deduce the above formula for poly ,(n) in terms of a Bernoulli-Barnes 
polynomial. (Hint: convince yourself that the residue of f(z) at z = 1 
equals the residue of e* f(e*) at z = 0.) 
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14.4. Give an integral version of Theorem 2.13. 
14.5. & Show that 


il 1 1 z 
(z= -+ ( ~ 4 24 :) 
*  (m,n)ez2\(0,0) \* (m+ni) m+ni  (m+ni) 


converges absolutely for z belonging to compact subsets of C \ Z?. 


14.6. & Prove (14.2), that is, 


t@=-4- ¥ ( ee 2) 


(m,n)€Z?\ (0,0) (z — (m+ nt)) (m + ni) 


14.7. & Prove (14.3), that is, show that 0! (z+ 1) = g'(z). 


14.8. & Prove (14.4), that is, show that for all zo, z1 € C \ Z?, the integral 
1 g!(w) dw is path-independent. 


14.9. & Show that ¢ is even, that is, 9 (—z) = 9(z). 
14.10. & Finish the proof of Lemma 14.3 by showing that g(z +7) = 9(z). 


14.11. & Prove that the integral in (14.5), 


w=m+ni 
C(e+m+ni)—¢@)=- f o(z+w) de, 


is path-independent. 


14.12. & Prove (14.6), that is, o(iz) = —e(z). 


Open Problems 


14.13. Can we get even more information about the WeierstraB g- and ¢- 
functions using more detailed knowledge of the discrepancy between J and A 
for special curves C'? 


14.14. Find a complex-analytic extension of Theorem 14.5 to higher dimen- 
sions. 


Appendix A 


Vertex and Hyperplane Descriptions of 
Polytopes 


Everything should be made as simple as possible, but not simpler. 


Albert Einstein 


In this appendix, we prove that every convex polytope has both a vertex and 
a hyperplane description. This appendix owes everything to Giinter Ziegler’s 
beautiful exposition in [258]; in fact, these pages contain merely a few cherries 
picked from [258, Lecture 1]. 

As in Chapter 3, it is easier to move to the world of cones. To be as concrete 
as possible, let’s call  C R?@ an h-cone if 


K={xeR?: Ax <0} 


for some A € R™*4; in this case, K is given as the intersection of m half-spaces 
determined by the rows of A. We use the notation K = hcone(A). 
On the other hand, we call K C R?@ a v-cone if 


K={By: y= 0} 


for some B € R@*”, that is, K is a pointed cone with the column vectors of 
B as generators. In this case, we use the notation K = vcone(B). 

Note that according to our definitions, every h-cone and every v-cone 
contains the origin in its apex. We will prove that every h-cone is a v-cone 
and vice versa. More precisely: 


Theorem A.1. For every A € R™*4, there exists B ¢ R&%*” (for some n) 
such that hcone(A) = vcone(B). Conversely, for every B € R?*", there exists 
AeER™*¢ (for some m) such that vcone(B) = hcone(A). 


We will prove the two halves of Theorem A.1 in Sections A.1 and A.2. For 
now, let’s record that Theorem A.1 implies our goal, that is, the equivalence 
of the vertex and half-space description of a polytope: 
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Corollary A.2. If P is the convex hull of finitely many points in R¢, then 
P is the intersection of finitely many half-spaces in R¢. Conversely, if P is 
given as the bounded intersection of finitely many half-spaces in R¢, then P 
is the convex hull of finitely many points in R¢. 


Proof. If P = conv {v1,V2,---;Vn} for some v1, V2,---,;Vn € R%, then coning 
over P (as defined in Chapter 3) gives 


cone(P) = vcone 6 - oe > : 
By Theorem A.1, we can find a matrix (A,b) € R™*<‘4 such that 
cone(P) = hcone(A,b) = {x € R“*1 : (A,b)x < 0}. 
We recover the polytope P on setting rg41 = 1, that is, 


P={xeR*: Ax<-b}, 


which is a hyperplane description of P. 
These steps can be reversed: Suppose the polytope P is given as 


P={xeR*’: Ax<—b} 
for some A € R™*4 and b € R™. Then P can be obtained from 
hcone(A,b) = {x € R¢*?: (A, b) x < 0} 


by setting xg,1 = 1. By Theorem A.1, we can construct a matrix B € R(t)" 
such that 
hcone(A, b) = vcone(B) . 


We may normalize the generators of vcone(B), that is, the columns of B, such 
that they all have their (d+ 1)** variable equal to 1: 


= Vi V2... Vn 
B= (Vt). 


Since P can be recovered from vcone(B) by setting zg+1 = 1, we conclude 
that P = conv {vj,V2,..-,Vn}- 


A.1 Every h-Cone Is a v-Cone 


Suppose 
K = hcone(A) = {x € R?:; Ax< o} 
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for some A € R™*4, We introduce an auxiliary m-dimensional variable y and 
write 


x={(9) ERY Ax<ybn{(*) e RU y=oh. (A.1) 


(Strictly speaking, this is K lifted into a d-dimensional subspace of R¢+™.) 
Our goal in this section is to prove the following two lemmas. 


Lemma A.3. The h-cone {() ERt™: Ax< y} is a v-cone. 


Lemma A.4. If K C R®@ is a v-cone, then so is KN {x €R?: a, = o}, for 
every k. 


The first half of Theorem A.1 follows with these two lemmas, since we can 
start with (A.1) and intersect with one hyperplane y; = 0 at a time. 


Proof of Lemma A.8. We begin by noting that 


c= {()earnsassy] 
“{() ex an(8) 9] 


is an h-cone; here I represents the m x m identity matrix. Let’s denote the 
k* unit vector by e,. Then we can decompose 


G)=Ea(xe) hm -xo(2) 


7 3 |xj| sign (x) & 7 3 el (2) 


Note that if G) € K, then y, —(Ax), > 0 for all k, and so (*) can be written 

as a nonnegative linear combination of the vectors sign (x;) (42), 1 <j <4, 
iz 

and (o), 1<k<m. But this means that K is a v-cone. 


Proof of Lemma A.4. Suppose K = vcone(B), where B has the column vec- 
tors b,,bo,...,b, € R%; that is, b,,b2,...,b, are the generators of K. Fix 
k <d and construct a new matrix By, whose column vectors are all the b; for 
which bj, = 0 together with the combinations b;,b; — b;,b; whenever bj, > 0 
and bj, <0. We claim that 


Kn {x € R*: x, = 0} = veone (Bx). 
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Every x € vcone(B,) satisfies x, = 0 by construction of By, and so 
veone (Bx) C KN {x € R*: 2, =0} follows immediately. We need to do 
some more work to prove the reverse containment. 

Suppose x € KN {x € R?: x, =O}, that is, x = Ayby + Agb2 +--+ +Anbn 
for some Ay, A2,...,An > 0 and ry = Aybi~ + Agbor +--+ + Anbdng = 0. This 
allows us to define 


So dabixn =— SO Agdge 
i: bin >O ji bye <0 
Note that A > 0. Now consider the decomposition 
S> Aybj + SO Abi + So Ajb;. (A.2) 
j: bj ~=0 i: bip >O J: b5%<O 


If A = 0, then \;b;, = 0 for all i such that b;, > 0, and so A; = O for those i. 
Similarly, A; = 0 for all 7 such that bj, <0. Thus we conclude from A = 0 
that 
ys Ajb; € vcone (Bx). 
j: bjr=0 


Now assume A > 0. Then we can expand the decomposition (A.2) into 


= © vb+5 — So Ajdyx & vn 


9: 65,=0 Jj: bjr<O i: bin >O 
+ iL S wa] Y> Ab; 
a: bin >O ji bjn<O 
= S> A;by+ ; > Aids (binby — Byxbs), 
Jj: b5~=0 a: big, >O 
J: b5%<O 


which is by construction in vcone (Bx). 


A.2 Every v-Cone Is an h-Cone 


Suppose 
K = vcone(B) = {By: y > 0} 


for some B € R?*”. Then K is the projection of 


{(*) ERM, y>0,x=By} (A.3) 
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to the subspace {() ERM: y= of. The constraints for (A.3) can be 


written as 


y 


Thus the set (A.3) is an h-cone, for which we can project one component of 
y at a time to obtain K. This means that it suffices to prove the following 
lemma to finish the second half of Theorem A.1. 


y>0 and (I, —B) (*) ii, 


Lemma A.5. If K is an h-cone, then the projection {x — xpex : x € K} is 
also an h-cone, for every k. 


Proof. Suppose K = hcone(A) for some A € R™™*?. Fix k and consider 
Pr={x+Ae,n: xEK, AER}. 
The projection we are after can be constructed from this set as 
{x — apen: XE K}=PpN {xe R*: 2, =0}, 


so that it suffices to prove that P, is an h-cone. 

Suppose a1, a2,..., A, are the row vectors of A. We construct a new matrix 
A, whose row vectors are all a; for which aj; = 0, and the combinations 
Ajpaj; — A;,a; Whenever a;, > 0 and aj, <0. We claim that P, = hcone (A,). 

If x € K, then Ax < 0, which implies A; x < 0 because each row of Ax 
is a nonnegative linear combination of rows of A; that is, K C hcone (Ax). 
However, the k*® component of A, is zero by construction, and so K C 
hcone (Ax) implies P;, C hcone (Ax). 

Conversely, suppose x € hcone(A;). We need to find A € R such that 
A (x — rex) < 0, that is, 


a4101 +++: + Ain (@_-A) +--+ + a1ata <0, 


Gmi%1 ++++ + Omg (Le — A) +++ +Gmata <0. 


The j*" constraint is aj -x — aj~ < 0, that is, aj -x < aj,A. This gives the 
following conditions on A: 


aj: xX : 
A> : if aj, > 0, 
Qik 
a;:-xX . 
v\< 2 if ajp <0. 
ajk 


Such a A exists, because if aj, > 0 and aj, <0, then (since x € hcone (A,)) 


(Gia; = jk Ai) ‘xX < 0, 
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which is equivalent to 


< ae an 
Qik ajk 
Thus we can find \ that satisfies 
aj;-xX a;-xX 
——<A<——, 
Qik Aik 


which proves hcone (Ax) C Px. 


Hints for & Exercises 


Well here’s another clue for you all. 


John Lennon & Paul McCartney (“Glass Onion,” The White Album) 


Chapter 1 


1.1 Set up the partial fraction expansion as 


Zz _ A é B 
Lag=2 gts ga devey 


and clear denominators to compute A and B; one can do so, for example, by 
specializing z. 


1.2 Multiply out (1 — z) (1 tgtz2+..-4 e*). For the infinite sum, note 
that ling 44..8° = Ot lel I, 


1.3 Start with the observation that there are |x| +1 lattice points in the 
interval [0, x]. 


1.4 (i) & (j) Write n = gm +r for some integers g,r such that 0 < r << m. 
Distinguish the cases r = 0 and r > 0. 


1.9 Use the fact that for m and n relatively prime and a € Z, there exists 
b € Z (which is unique modulo n) such that mb = a (mod n). For the second 
equality of sets, think about the case a = 0. 


1.12 First translate the line segment to the origin and explain why this 
translation leaves the integer-point enumeration invariant. For the case (a, b) = 
(0,0), first study the problem under the restriction that gcd (c,d) = 1. 


1.17 Given a triangle 7 with vertices on the integer lattice, consider the 
parallelogram P formed by two fixed edges of 7. Use integral translates of 
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P to tile the plane R?. Conclude from this tiling that P contains only its 
vertices as lattice points if and only if the area of P is 1. 


1.20 Given an integer b, the Euclidean algorithm asserts the existence 
of m,,M2,...,mq € Z such that b can be represented as b = mya, + 
mag +--+: + mMmgdag. Convince yourself that we can demand that in this 
representation, 0 < m2,m3,...,™mq¢ < a,. Conclude that all integers beyond 
(a, — 1) (ag + a3 +--++ a) are representable in terms of a1, a2,..., aa. (This 
argument can be refined to yield another proof of Theorem 1.2.) 


1.21 Use the setup 
b-1 


A An, An Bi, Be a 
f(2= z T ga eae Veet Galt ge 


To compute C;, multiply both sides by (z _ e*) and calculate the limit as 
z— €*, The coefficients D; can be computed in a similar fashion. 


1.22 Use Exercise 1.9 (with m = b~') on the left-hand side of the equation. 


1.24 Suppose a > b. The integer a+ 6 certainly has a representation in terms 
of a and b, namely, 1-a+1-b. Think about how the coefficient of b would 
change if we changed the coefficient of a. 


1.31 Use the partial fraction setup (1.11), multiply both sides by (z - ae 
and take the limit as z > ae 


1.33 Convince yourself of the generating-function setup 


6 A ge 2% gad 
Sram" = (=) (FR) (-F§)- 
n>1 


Now use the machinery of Section 1.5. 


Chapter 2 


2.1 Use Exercise 1.3 for the closed interval. For open intervals, you can 
use Exercise 1.4(j) or the [...] notation of Exercise 1.4(e). To show the 
quasipolynomial character, rewrite the greatest-integer function in terms of 
the fractional-part function. 


2.2 Write FR as a direct product of two intervals and use Exercise 1.3. 


2.6 Start by showing that the convex hull of a d-element subset W of V is a 
face of A. This allows you to prove the first statement by induction (using 
Exercise 2.5). For the converse statement, given a supporting hyperplane H 
that defines the face F of A, let W C V consist of those vertices of A that 
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are in H. Now prove that every point 
X= Avi + Agave + +++ + Adi Va41 


in F has to satisfy \;, = 0 for all v;, ¢ W. 


2.7 First show that the linear inequalities and equations describing a rational 
polytope can be chosen with rational coefficients, and then clear denominators. 
2.8 

(a) Prove that as rational functions, )7 5.9 j427 = (—1)* Yjso 5% ay 

d_ A(d,k)z* ! 4-1 4(d—1,k)z* 

(b) Use the fact that 2 ale =zi (S435, de ): 

c) Begin by proving one of the remarks in the notes of Chapter 2, namely, 

& & 
that A(d,k) counts the permutations of {1,2,...,d} with k — 1 ascents. 

(d) Use the fact that S\{_9 A (d,k) 2* = (1—z)441 D542. 


: 1 k k k 
2.9 Write Goya = (Dest Zz :) (eker ») vee (Saas z ais) and 
come up with a combinatorial enumeration scheme to compute the coefficients 


of this power series. 


2.10 Write Cy) = GPR) EFR 1 eb h dt) and switch ¢t to —t. 


2.14 Think about the poles of the function <+,; and use a theorem from 


complex analysis. 


I 


gane? 
l—e-?° 


2.15 Compute the generating function of By(1 — x) and rewrite it as 


2.16 Show that =, + hz is an even function of z. 


2.23 Follow the steps of the proof of Theorem 2.4. 


2.25 Extend 7 to a rectangle whose diagonal is the hypotenuse of 7, and 
consider the lattice points on this diagonal separately. 


2.26 For the area use elementary calculus. For the number of boundary 
points on tP, extend Exercise 1.12 to a set of line segments whose union forms 
a simple closed curve. 


2.33 Rewrite the inequality as (| 4] 1) e+ ({2] 1) f <tr and compare 
this with the definition of T. 


2.34 To compute C3, multiply both sides of (2.20) by (z — 3)? and compute 
the limit as z > 1. The coefficients Aj and B; can be computed in a similar 
fashion. To compute C2, first move webs in (2.20) to the left-hand side, 
then multiply by (z — 1)? and take the limit as z > 1. A similar, even more 
elaborate, computation gives C). (Alternatively, compute the Laurent series 
of the function in (2.20) at z = 1 with a computer algebra system such as 
Maple, Mathematica, or Sage.) 
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k = 
2.35 Show that lim, .-+ 1, + & (2—éh)" = a1. 


2.36 Follow the proof of Theorem 2.10. Use Exercise 2.35 to compute the 
additional coefficients in the partial fraction expansion of the generating 
function corresponding to this lattice-point count. 


2.38 Start with computing the constant term of 


1 
(1 — 2129) (1 — z?zq) (1 — 21) (1 — 22) zPez3e 


with respect to z2 by treating z1 as a constant and setting up a partial fraction 
expansion of this function with respect to 22. 


Chapter 3 

3.3 

(a) Convince yourself that if the supporting hyperplane of F is horizontal, 
then h1,h2,...,hn in (3.1) were not chosen randomly. 

(b) Prove that if aa41 > 0, then there cannot exist a point (11, 2,...,%a41) € 
F and an ¢ > 0 such that (x,22,...,%a41 — €) € Q, and if agy1 < 
O and (a1,%2,...,Ua41) © F, then there exists « > O such that 
(11, Xa, see y Lgty — €) €E Q. 


(c) Use the fact that 7, MF. is a face common to both F, and Fo. 
3.5 Write the simplicial cones as cones over simplices and use Exercise 2.6. 


3.8 Write down a typical term of the product 


Og (21, 225-+-52m) OT (Zm4i1; Zm42;+-+)2m-+tn)- 


3.9 Multiply out z™ox(z). 


: : . i. 4 1 =i. a1 =1 
3.10 Write a typical term in og (2, tees 1) =05s (2 SQ yey Sq ) F 
3.13 Given the polynomial f, split up the generating function on the left-hand 
side according to the terms of f and use (2.2). Conversely, if the polynomial 
g is given, use (2.6). 


3.18 Show that HZ? is a Z-module. Therefore, it has a basis; extend this 
basis to a basis of Z%. 


3.19 Think about a small (even irrational) perturbation of all hyperplanes 
in the right direction. 
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3.21 View aa 
ef gts) a 1 
af a ae 
det 
d d-1 
Ta41 Tayi 1 Tati 1 
as a polynomial in x1, considering %2,%3,...,%q41 constants. Show that this 
polynomial has roots at 2, = ®2, ©] = %3, ...., £1 = La41, and compute its 


leading coefficient. 


3.24 Given f, split up the generating function on the left-hand side according 
to the constituents of f; then use Exercise 3.13. Conversely, given g and h, 
multiply both by a polynomial to get the denominator on the right-hand side 
into the form (1 — z”)¢++; then use (2.6). 


3.25 Start with the setup on page 80, and closely orient yourself along the 
proof of Theorem 3.8. 


3.32 Convince yourself that if P has denominator p, then Lp(0) equals the 
constant term of the Ehrhart polynomial of pP. 


3.34 Use Lemma 3.19. 


Chapter 4 


4.1 Use Exercise 2.1. 
4.2 Use the explicit description of II given by (4.3). 


4.4 Consider each simplicial cone K; separately, and look at the arrangement 
of its bounding hyperplanes. For each hyperplane, use Exercise 3.18. 


4.7 For (a), convince yourself that Q(—t) is also a quasipolynomial. For (b), 
use (1.3). For (c), differentiate (1.3). For (d), think about one constituent of 
the quasipolynomial at a time. 


4.8 In the generating function for Lp(t—k), make a change in the summation 
variable; then use Theorem 4.4. 


4.13 Use the fact that A has only integral entries. For the second part, write 
down the explicit hyperplane descriptions of (t + 1)P° and tP. 


4.14 Assume that there exist t € Z and a facet hyperplane H of P such that 
there is a lattice point between tH and (t+ 1)H. Translate this lattice point 
to a lattice point that violates (4.12). 
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Chapter 5 


5.3 To prove the inclusion C (the other inclusion is clear), you need to show 
that every point in P lies in the interior of some face F of P, and that the 
relative interiors of two different faces are disjoint. One can prove both facts 
by looking for a minimal face that contains a given point in P in its relative 
interior. 


5.5 Consider an interval [F,P] in the face lattice of P; namely, [F,P] 
contains all faces G such that F C G C P. Prove that if P is simple, then 
every such interval is isomorphic to a Boolean lattice. 


5.6 Use Exercise 2.6 to show that the face lattice of a simplex is isomorphic 
to a Boolean lattice. 


5.7 Orient yourself along the proof of Theorem 5.3, but start with the Euler 
relation (Theorem 5.2) for a given face F instead of (5.2). 


5.9 Assume that the origin is in the interior of P. 


(a) Given a face F of P, show that 
F* := {x eR’: x-y <1 forally€P, x-y=1 forally € F} 


is a face of P*, and that G C F if and only if G* D> F*. 

(b) Figure out a prominent property of the face lattice of every sim- 
ple/simplicial polytope; then use (a). 

(c) Try a proof that is dual to the one hinted at in Exercise 5.7. 


5.14 


(a) First choose any two of the five points on the sphere, and draw a great 
circle through them. This great circle partitions the sphere into two 
hemispheres, and the union of these two hemispheres now contains three 
of the five given points. Now the pigeonhole principle tells us that (at 
least) two of the five given points must lie in one of these two hemispheres. 

(b) Normalize the five given vectors to have unit length and use part (a) of 
this exercise. 


Chapter 6 


6.1 Think permutation matrices. 
6.3 Show that the rank of (6.5) is 2n — 1. 


6.5 Start by showing that all permutation matrices are indeed vertices. Then 
use Exercise 6.4 to show that there are no other vertices. 
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6.6 Establish a bijection between semimagic squares with line sum t — n and 
semimagic squares with positive entries and line sum t. 


6.7 Think about the smallest possible line sum if the entries of the square 
are positive integers. 


6.8 Follow the computation on page 120 that led to the formula for Ho. 
6.9 Multiply both sides of (6.7) by (w - i) and take the limit as w > re 


6.10 Orient yourself along the computation in (6.10). 


6.16 Compute the matrix equivalent to (6.5) for the polytope describing all 
magic squares of a given size. Show that this matrix has rank 2n + 1. 


6.18 Orient yourself along the computation on page 120. 


Chapter 7 


7.2 Use Exercise 7.1. 
7.5 Differentiate (1.3). 
7.6 Use (1.3). 


7.7 Write an arbitrary function on Z with period 6 in terms of 6,,(), 
l<m<ob. 


7.8 Use the definition (7.6) of the inner product and the properties 2Z = |z|? 
and (zw) = Z-W for complex numbers z and w. 


7.15 Use the definition (7.4) and simplify the fractional-part function in the 
sum on the right-hand side. 


7.23 Use the definition of F. 


Chapter 8 


8.7 Use Exercise 1.9. 


8.9 Use the methods outlined in the hints for Exercises 1.21 and 2.34 to 
compute the partial fraction coefficients for z = 1 in (8.3). 


8.12 Multiply out all the terms on the left-hand side and make use of 
Exercises 1.9 and 7.15. 


8.15 Use the methods outlined in the hints for Exercises 1.21 and 2.34 to 
compute a partial fraction expansion of (8.8). 
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Chapter 9 


9.1 Write out a typical term in p(21, 22,..., 2a) @(21, 22;--+; 2d): 
9.5 Review the definition of Z(uj, u2,...u,) to show that 
TI, Ug U--- UTI, UP) UP2U+ ++ U Pm = Z(u1, U2,... Un) 
and then show that this union is disjoint, using the precise definition of 
the P;’s. 


9.9 Convince yourself that Pg lies in the hyperplane given by the equation 
U+tat-:++2%g= iF and find d— 1 linearly independent vectors among 
those defining Py. To show that v = (m(1) — 1,7(2) —1,...,7(d)—1) isa 
vertex of Py, consider the hyperplane through v with normal vector v. 


Chapter 10 


10.1 Expand the powers of (1 — z) in the definition of hp(z). 


10.2 Assume that P is full-dimensional and consider a facet of S by con- 
structing one of its supporting hyperplanes. Show that this hyperplane can 
contain no more than d vertices of S. To show that the face numbers of S 
equal those of the triangulation of OP, try to construct a bijection. 


10.4 Given the unimodular k-simplex A C R%, construct a bijection 
span(A) + R* that maps A to {x€ R§, 1 Uy +42 +--+ +2, <1} and 
span(A)/M Z? to Z*. 


10.7 Start by showing that if A is a k-simplex and 
Tes {Ai wi + Agwe +--+ Ak+IWk+1 :0< 1, r2, at » AR+41 < 1} 


is the open fundamental parallelepiped of cone(A), then 


T= —I+ wi +wet+--++Weey1- 


10.8 Carefully review the definition of link(A). 


10.9 Begin by rewriting (10.3), collecting terms stemming from simplices ® 
of the same dimension. 


10.10 Begin by proving that 


m(A) = {o}U LJ 1), 


QCA 
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where the union is over all nonempty faces of A, and that this union is disjoint. 


10.11 Build an almost-one-to-one correspondence between the simplices in 
T and those in Tp \ TJ, and see what this correspondence gives about the 
respective h-vectors of links. 


10.12 Establish a one-to-one correspondence between link(A) and the bound- 
ary faces of a polytope of dimension d — dim(A) — 1 respecting the face 
relations. 


10.13 Writing a(z) = agz? + ag_1z4-1 + +--+ a9 and b(z) = bg_az4-1 + 
ba_2z4-? +--+ + bo, give a concrete rule how to compute ao, ag, ba—1, bo, a1, 
da—1, ba—2, --- from the coefficients of p(z). 


10.14 See what your rule from Exercise 10.13 says about the coefficients of 
h5(z) when one takes into account that the a;’s and b,;’s are nonnegative. 


Chapter 11 


11.4 Consider the hyperplanes Hy, H2,...,Ha41 that bound A. For each 
hyperplane H;, denote by H hi the closed half-space bounded by H;, that 
contains A, and by H,; the open half-space bounded by H; that does not 
contain A. Show that every tangent cone of A is the intersection of some 
of the H ni ’s, and conversely, that every intersection of some of the H Ay ’s, 
except for A = fia H,*, is a tangent cone of A. Since Hj} UH, = Rasa 
disjoint union, for each k, the point x is either in H Bd or H,,. Prove that the 
intersection of those H a that contain x is the sought-after tangent cone. 


11.6 As in Exercise 5.6, show that the face lattice of a simplex is a Boolean 
lattice. Note that every sublattice of a Boolean lattice is again Boolean. 


11.8 One approach to this problem is first to dilate P and the corresponding 
hyperplanes in H by a small factor. To avoid subtleties, first translate P by 
an integer vector, if necessary, to ensure that none of the hyperplanes in H 
contains the origin. Use Exercise 3.18. 


11.9 Adjust the steps in Section 11.4 to open polytopes. Start by proving a 
Brianchon—Gram identity for open simplices, by analogy with Theorem 11.5. 
This implies a Brion-type identity for open simplices, as in Corollary 11.6. 
Finally, adjust the proof of Theorem 11.7 to open polytopes. 


Chapter 12 


12.1 Use (12.3), Exercise 2.18, and (2.11). 
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12.2 Review the proof of Theorem 3.5. 


12.3 Use the definition of unimodularity to show that the only integer point 
in the fundamental parallelepiped of K is v. 


12.6 Orient yourself along the proof of Theorem 12.3; instead of a sum over 
vertex cones, just consider one simple cone K. 


Chapter 13 


13.5 Multiply out z™ax(z). 


13.6 Orient yourself along the proof of Theorem 4.2. Note that for solid 
angles, we do not require the condition that the boundary of K contains no 
lattice point. 


13.7 As a warmup exercise, show that 


> > OK, (F)° (Z) = x > F(Z). 


FCA va vertex of F va vertex of A FCKy 
dim F>0 dim F>0 


13.8 Start with the setup of our second proof of Ehrhart’s theorem in 
Section 11.5; that is, it suffices to prove that if p is the denominator of P, then 
Ap(—r — pt) = (—1)%™? Ap(r + pt) for all integers r and t with 0 < r < p 
and t > 0. (Think of r as fixed and ¢ as variable.) Now orient yourself along 
the proof on page 232. 


Chapter 14 


14.1 Bound the integral from above, using the length of C, and an upper 
bound for the absolute value of the integrand. 


14.2 The nontrivial roots of unity are simple poles of f, for which the residue 
computation boils down to a simple limit. 


1 1 p . 
pa (ay and >3;, into one fraction. 


14.5 Start by combining the terms 
14.6 Differentiate (14.1) term by term. 

14.7 Compute g’ explicitly. 

14.8 Use a famous theorem from complex analysis. 


14.9 Compute g (—z) and use the fact that (—(m + in))? = (m+ in)?. 
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14.10 Repeat the proof of Lemma 14.3, but now starting with the proof of 
g! (z+ 4) = p'(z). 
14.11 Use a famous theorem from complex analysis. 


14.12 Use the definition of the Weierstra8 o-function. 
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Dedekind sum, 138, 142, 150, 158 
Dedekind’s reciprocity law, 153 
Dedekind—Rademacher sum, 155 
degenerate hyperplane, 28 
degree 
of a polytope, 94, 192 
of a quasipolynomial, 47 
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degree sequence, 181 
Dehn-Sommerville relations, 101, 110, 184 
Delannoy path, 56 
delta function, 140 
Delzant polytope, 221 
denominator, 80 
dihedral angle, 228 
dilate, 7, 29 
dimension 
of a pointed cone, 63 
of a polytope, 28 
discrete volume, 29, 32, 38, 78, 89, 229 
distance, 140 
doubly periodic function, 243 
doubly stochastic matrix, 116 
dual polytope, 97, 110 


edge 
of a pointed cone, 64 
of a polytope, 28 
Ehrhart polynomial, 68, 72, 94, 105, 158 
constant term of, 74 
interpolation of, 78 
leading coefficient of, 77 
of a zonotope, 176 
second coefficient of, 108 
Ehrhart quasipolynomial, 80 
Ehrhart series, 30, 51, 72, 93, 159, 183 
Ehrhart’s theorem, 68, 80, 117 
Ehrhart—Macdonald reciprocity, 90, 102, 
117, 155 
empty simplex, 187 
Erdés—Gallai theorem, 181 
Euclidean algorithm, 25, 153 
Euler ¢-function, 147 
Euler characteristic, 82 
Euler relation, 102 
Euler—Maclaurin summation, 213 
Eulerian number, 30, 51 
exponential function, 214 


face 
lattice, 102 
number, 101 
of a pointed cone, 64 
of a polytope, 28 
visible, 210 
facet 
of a pointed cone, 64 
of a polytope, 28 
visible, 196 
Farey sequence, 56 
Fibonacci sequence, 3, 162 
finite Fourier series, 133 
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finite Fourier transform, 139 

finite geometric series, 19, 141, 199 
forest, 174 

Fourier coefficient, 136 

Fourier transform, 139 
Fourier—Dedekind sum, 15, 149 
Fourier—Laplace transform, 218 
fractional-part function, 10, 20, 137 
free sum, 97 

frequency, 137 

Frobenius number, 6 

Frobenius problem, 3 

fundamental parallelepiped, 66, 187, 217 


Gau8 sum, 148 
gcd algorithm, 153 
generalized permutahedron, 178 
generating function, 3, 8, 14, 30, 65, 68, 
119 
constant term of, 8, 14, 45, 124, 158 
multivariate, 49 
rational, 64, 199 
generator of a pointed cone, 63 
geometric series, 4, 19, 66, 134 
geometry of numbers, 181 
Gorenstein polytope, 98 
graded algebra, 51 
graph, 102, 129, 174, 196 
graphical zonotope, 181 
greatest-integer function, 10, 19, 89, 137 


h-cone, 249 
h*-polynomial, 72, 183 
half-space, 28 
hemisphere, 178 
Hibi’s palindromic theorem, 95 
Hilbert—Poincaré series, 51 
hyperplane, 28, 63 
degenerate, 28 
rational, 85 
hyperplane arrangement, 85, 180, 201 
central, 201 
rational, 201 
hyperplane description, 28, 51, 249 
hypersimplex, 85 


indicator function, 205 

induced triangulation, 184 

infinite sequence, 3 

inner product, 140, 146 

integer decomposition property, 192 

integer-point transform, 64, 69, 90, 199 
of a simplicial cone, 66 

integral polygon, 40 
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integral polytope, 28, 68, 108, 236 order polytope, 51 
integrally closed, 192 orthogonal complement, 210 
interior, 30, 89, 93 orthogonality relations, 141 
relative, 102, 117, 228 
interpolation, 77, 117 palindromicity 
interval, 27 of the Ehrhart series of a reflexive 
isolated singularity, 241 polytope, 95 
of the Ehrhart series of the Birkhoff—von 
Jacobi symbol, 160 Neumann polytope, 117 
of the solid-angle generating function, 
k-skeleton, 103 236 
Khovanskii—Pukhlikov theorem, 221 parallelepiped, 66, 167 
fundamental, 66, 187 
abeled degree sequence, 181 Parseval identity, 141 
Lagrange interpolation formula, 78 part of a partition, 7 
Laguerre polynomial, 52 partial fractions, 4, 8, 14, 45, 134, 151, 158 
Latin square, 126 partition, 7 
attice, 7, 27, 81, 246 period, 47, 80 
attice basis, 22 periodic function, 47, 133, 139 
attice point, 7, 27, 35, 64 periodic sequence, 135 
attice-point enumerator, 29, 48 permutahedron, 172 
Laurent monomial, 64 generalized, 178 
Laurent series, 8, 241 permutation matrix, 116 
ine segment, 12, 21, 28, 89, 167, 199 perturbation, 221 
ine shelling, 196 Pick’s theorem, 40, 245 
inear Diophantine problem of Frobenius, 6 —Plancherel theorem, 141 
inear map, 84, 202 pointed cone, 62, 201 
ink, 187 rational, 62 
ocal Riemann hypothesis, 52 polar polytope, 97, 110 
ower face, 60 polygon, 27 
ower hull, 60, 184 integral, 40 
Luo Shu square, 114 rational, 43, 47 
polyhedron, 28, 201 
magic square, 114 polynomial, 30, 33, 36, 42, 68, 75, 77, 84, 
traditional, 114 108, 117, 145, 199 
magic sum, 113 interpolation of, 78, 117 
Mahler’s conjecture, 100 polytope, 7, 27, 249 
matrix centrally symmetric, 170 
doubly stochastic, 116 Delzant, 221 
unimodular, 88, 181 dual, 97, 110 
matroid, 179 Gorenstein, 98 
metric space, 140 integral, 28, 68, 108, 236 
Minkowski sum, 167 integrally closed, 192 
Minkowski’s fundamental theorem, 181 Newton, 167 
Mobius function, 147 polar, 97, 110 
module, 201 rational, 28, 48, 80, 230 
moment, 218 simple, 101, 196, 216 
moment generating function, 65 simplicial, 110, 183 
monomial moment, 218 smooth, 221 
unimodular, 221 
Newton polytope, 167 power series, 8 
northeast lattice path, 22 primitive root of unity, 147 


pyramid, 35, 51, 235 
octahedron, 38 over a polytope, 36 
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quasipolynomial, 47, 80, 86, 124, 230 
constituent of, 47, 125 
degree of, 47 
period of, 47, 80 


Rademacher reciprocity, 155 
rational cone, 201 
rational generating function, 64, 199 
short, 209 
rational pointed cone, 62 
rational polygon, 43, 47 
rational polytope, 28, 48, 80, 230 
reciprocity law 
for the Carlitz polynomial, 164 
for the classical Dedekind sum, 153 


for the Dedekind—Rademacher sum, 155 
for the Fourier—Dedekind sum, 152, 154 


reciprocity theorem 
for integer-point transforms, 92 
for lattice-point enumeration, 90 


for solid-angle generating functions, 231 


for solid-angle sums, 232 
regular tetrahedron, 54, 238 
regular triangulation, 60, 184 
relative interior, 102, 117, 228 
relative volume, 107 
representable, 6 
residue, 241 
residue theorem, 242 
restricted partition function, 6, 14, 151 

polynomial part of, 151, 247 
rhombic dodecahedron, 169 
Riemann zeta function, 34 
root, 75 
root of unity, 8 

primitive, 147 


sawtooth function, 137, 156 
semigroup, 84 
semimagic square, 114 
symmetric, 128 
shelling, 109, 195 
line, 196 
short rational generating function, 209 
simple polytope, 101, 196, 216 
simplex, 28, 59, 80 
empty, 187 
faces of, 53 
standard, 31, 185, 228 
unimodular, 185 
simplicial 
cone, 63 
polytope, 110, 183 
skeleton, 103 


slack variable, 32, 44, 48 
Smith normal form, 176 
smooth polytope, 221 
solid angle, 227 
generating function, 231 
of a face, 228 
polynomial, 234 
series, 236 
sum of, 229 
standard simplex, 31, 185, 228 
Stanley’s nonnegativity theorem, 72 
Stanley’s reciprocity theorem, 92 
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Stirling number of the first kind, 33, 54 


sublattice, 107 

supporting hyperplane, 28, 63 
Sylvester’s theorem, 6 

symmetric about the origin, 170 
symmetric semimagic square, 128 


tangent cone, 204 
tetrahedron, 28, 158 
Todd operator, 214, 221 
toric variety, 51, 224 
traditional magic square, 114 
transportation polytope, 126 
tree, 176 
triangle, 22, 28, 43 
triangulation 

boundary, 184 

induced, 184 

of a pointed cone, 64 

of a polygon, 41 

of a polytope, 59 

regular, 60, 184 

unimodular, 185 
trigonometric identities, 144 
trivial, 123 
Tutte polynomial, 179 


unimodal, 127, 197 
unimodular 
cone, 221 
group, 88 
matrix, 88 
polytope, 221 
simplex, 185 
triangulation, 185 
unit cube, 29, 35, 169 
unit vector, 31 
unitary transformation, 146 
unlabeled degree sequence, 181 


v-cone, 249 
valuation, 237 
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Vandermonde matrix, 85, 173 

vector partition function, 52 

vector space, 139 

vertex, 28, 127 

vertex cone, 204 

vertex description, 27, 51, 249 

vertex figure, 184 

visible, 60, 196, 210 

volume, 29, 76, 89, 116, 221, 230 
relative, 106 
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Weierstra8 g-function, 243 
Weierstra8 ¢-function, 243 


Zagier reciprocity, 152 
zero, 234 
zonotopal decomposition, 171 
zonotope, 169 
graphical, 181 
history, 178 


